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Just a few years ago, Al was the stuff of science fiction. 
However, powerful new computer chips, cheap data storage 
and the emergence of neural networks — among other 
technologies — have pushed Al to centre stage as a readily 
available tool for business applications. In the last couple 
of years, a growing number of companies have started to 
adopt Al and experiment with how it might reduce their 
operating costs, increase efficiency, grow market share 
and improve customer experience. Potential applications 
of Al range from back-office operations to customer-facing 
chatbots, and investment advisory services. The potential 
benefits are enormous. 





As the value proposition of a new technology gradually 
emerges, the risks and the legal and governance issues that 
it brings with it also need to be properly addressed. In this 
paper we aim to address these issues through an open- 
minded, in-depth examination of Al, from a technology- 
neutral stance. 


Underpinning this paper is an industry-wide survey 
which looks at how far Hong Kong’s banking sector has 
progressed in adopting this technology. We would like to 
thank The Chinese University of Hong Kong for its help in 
devising the questionnaire used in the survey. 
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This is the first in a series of papers looking into Al. It 
examines the technology in detail, shares a number of use 
cases being adopted in Hong Kong and explores some 
of the main findings from the survey. We hope that this 
and subsequent papers will offer Hong Kong’s banking 
sector some useful reference points when considering the 
adoption of Al. 


Lastly, we would like to express our gratitude to the banks 
and institutions that have participated in this paper by 
completing the questionnaire and spending time with us for 
interviews. 


Executive summary 


Among the many new technologies to have had an impact 
on the corporate world in recent years, Al looks to be a 
critical disruptor for the banking industry. Apart from gaining 
traction in the corporate world, Al has been adopted as a 
key national strategy by countries such as China, the US, 
the UK, Canada and Japan’. According to PwC’s Al Impact 
Index’, which looks at 300 AI use cases around the globe, 
the technology could contribute US$15.7 trillion to the 
global economy by 2030, boosting the GDP of individual 
countries by up to 2690. 


What does banking mean to HK? 


: $ ] 70 of the 100 world's largest 


banks operate in Hong Kong 


22 40 % of the 253,100 


financial services jobs in Hong 
Kong are in banking 


HK$270.4B 
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value added - amounting to 18% of Hong Kong’s GDP 


Financial services is one of the pillars of Hong Kong’s 
economy, accounting for 7% of jobs and generating 18.9% 
of the city’s GDP?. Seventy of the world’s 100 largest banks 
operate in Hong Kong, securing its role as an international 
financial centre. Banking alone makes up 40% of all 
financial services employment and 63% of the value that 
financial services add to Hong Kong’s GDP. Maintaining the 
competitive position of the banking sector is of fundamental 
importance to Hong Kong. 


Why is Al important? 
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Reshaping Banking with Artificial Intelligence 


e To discuss potential Al applications in the banking 


sector by sharing tried and tested use cases. 


e Tolookatthe latest developments and implementation 


issues as Al is adopted by the banking industry. 


e To promote and encourage adoption of Al by raising 
awareness of the benefits and risks associated with 
the technology. 





This research paper is targeted at: 


Operational and management-level staff of financial 
institutions, who may not necessarily have in-depth 
knowledge of Al. 


Stakeholders in banking digitalisation, such as: 
- AI and FinTech start-ups; 
- Technology incubators; 


- Regulators. 


The HKMA-PwC research is based on: 


Research into academic and commercial publications 
on Al covering economics, public policy, regulation, 
technology and public use cases. 


A Banking AI Survey conducted in August and 
September 2019 on 168 HKMA-registered banks. 


Deep Dive Interviews with individuals from a start-up, 
two FinTech incubators, a research organisation and ten 
banks. The banks were selected to provide a balance of: 


- Multinational, Chinese and local banks; 
- Large, medium and small banks; 


- Established banks and banks that have been operating 
in Hong Kong for less than five years. 


The research insights presented in this paper have been 
organised as follows: 


Chapter 1 introduces concepts necessary to understand 
Al from both business and technical perspectives. It 
looks at the most important components and technigues 
in current Al development. 


Chapter 2 analyses key technologies enabling the 
adoption of the current wave of Al. 


Chapter 3 looks at the drivers for the adoption of Al in 
the banking industry, along with benefits and potential 
barriers to adoption. 


Chapter 4 provides an overview of Al in Hong Kong’s 
banking industry. It shares popular Al solutions being 
implemented by banks globally and defines where Hong 
Kong is on the spectrum of Al development. 


Chapter 5 considers how to implement Al. 


Chapter 6 looks to the future and recommends ways 
in which different stakeholders can help develop Hong 
Kong into an Al Innovation hub. 


Introduction to Al 


Key Takeaways 


Since the term ‘Al’ was first coined in 1956, its scope has 
evolved from the application of predefined logical rules to 
the performance of human-like cognitive functions. The goal 
of Al is to allow computers to mimic human intelligence so 
that they can learn, sense, think and act in order to achieve 
automation and gain analytic insights. 


To mimic human intelligence, Al applications adopt 
two computation approaches: rule-based and non-rule 
based. Rule-based Al ‘learns’ using pre-defined rules and 
knowledge, and ‘thinks’ by inferring logical causes and 
effects according to ‘if-then-else’ rules. Non rule-based Al 
‘learns’ with machine learning algorithms and ‘thinks’ using 
trained Al models. 





Key enablers of AI development are the huge increase in the 
amount of data available and vastly increased computing 
power in both traditional computers and mobile devices, 
along with the ongoing development of machine-learning 
algorithms. 


Some recent Al developments include natural language 
processing for better data pre-processing, deep learning 
and neural networks for learning complex rules, and 
collaborative learning models for model training. 
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1.1. What do we mean by Al? 


Artificial Intelligence has progressed in fits and starts since the term was first coined in 1956 by the American computer 
scientist John McCarthy. In the 1980s and 1990s, Al was focussed on rule-based ‘expert systems’ that implemented 
predefined logical rules. Since the 2000s, data-driven Al has emerged as the main enabler of technologies. Al has now 
evolved to the point where it can interact and communicate with humans through analytics and automation. It can perform 
human-like cognitive functions (e.g. recognising patterns from data and understanding images, text or speech), as well as 
make predictions, recommendations and decisions. 


In the banking sector, analytics and automation technology have been in place for several years. So how does AI differ 
from earlier waves of automation? To understand this from an industry perspective, we put the “what is Al?” question to 
several banks in Hong Kong. Some of their answers are given below: 


It refers to using mathematical 
methods with predictive and decisive 
intelligence to substitute for humans 
in conducting repetitive tasks and 
seals the gap of various automation 
processes we have been doing over 
the years. 


Al is a new enabler for cost savings. 


It is the technology that enables 
machines to perform business 
activities which humans can do today. 


It is the future of banking, banks 
that fail to master AI will become 
obsolete. 


Compared to rule-based data 
analytics that we have been doing 
for a long time, AI is non rule-based 
business intelligence derived from 
analysing data. 





It is clear from these responses that banks recognise the importance of Al but that individual banks, with their different 
strategic focuses and business functions, may not have a full picture of what Al is and what it does. To aid the in-depth 
discussion in the following sections, we first offer a definition of Al. 
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1.2. A definition of Al 


In a nutshell, the goal of Al is to enable computers to mimic human intelligence so that they can learn, sense, think and 
act, in order to achieve automation and gain analytic insights. 


Input Data/Query 


Recommendations/Actions 





While machines can ‘sense’ and ‘act’ in relation to humans and the environment through user interfaces, sensors and 
robotics, the core of Al is the ‘think’ component — e.g. making predictions, decisions or recommendations. To mimic 
human intelligence, Al applications adopt two computation approaches: rule-based and non-rule based. 
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Rule-based decision making, also known as ‘expert 
systems’, is one of the simplest and most common types of 
Al. It is commonly used by almost all financial institutions. 
It involves translating a fixed set of pre-defined rules and 
knowledge about decision making into a knowledge 
base and computer logic (a chain of rules in an ‘inference 
engine”’). Rule-based Al mimics thinking by inferring logical 
causes and effects according to a chain of ‘if-then-else’ 
rules in its knowledge base. 


Rule-based Al systems became popular in the 1960s, and 
dominated the period from the 1970s to the 1990s, used 
especially for industrial robots performing repetitive tasks. 
Applications were limited however, as it can be very time- 
consuming and expensive to add new rules or knowledge 
to these systems so that they can react to the changing 
business environment. Also, in many scenarios it can be 
very challenging to explicitly define rules in a programmatic 
or declarative way. This led to a demand for, and the 
development of, non-rule-based Al. 


Non-rule-based Al not only reacts and make decisions 
according to a pre-defined set of rules, it can also derive 
extra, meaningful insights by ‘learning’ automatically from 
its inputs. 


Machine learning (ML) is one way to enable the ‘learn’ 
capability in Al. This involves using a set of learning 
algorithms driven by mathematical techniques which allow 
machines to learn from data, instead of being explicitly 
programmed to perform certain tasks. The training process 
uses the learning algorithm to derive relationships between 
data points from training data, which is commonly a subset 
of historical data. The outputs of the training are trained 
machine learning models, which can perform predictions or 
make decisions according to the data patterns observed 
from the input data, or from queries provided by users. 


ML is able to identify subtle data patterns which cannot 
be easily described by humans, and extract insights from 
less structured data. As a result, ML has become the main 
technique driving the current wave of Al applications — from 
call centre voice analysis to autonomous vehicles. 


However, ML in general is data and computing-heavy. 
In order to find data patterns or relationships and make 
accurate predictions, ML needs to process tremendous 
volumes of data. It needs to execute computing-intensive 
statistical techniques and mathematical algorithms for 
model training and for tuning and testing model selection. 


According to a 2018 McKinsey report”, AI and machine 
learning have the potential to create an additional $100 
billion in the finance and IT sectors. 


t An inference engine is a tool used to apply logical rules to knowledge 
assets to deduce new information. 


5 McKinsey (April 2018). Visualizing the uses and potential impact of 
Al and other analytics. https://www.mckinsey.com/featured-insights/ 
artificial-intelligence/visualizing-the-uses-and-potential-impact-of-ai-and- 








other-analytics 








1.3. Data and computation algorithms 


The significant progress of both rule-based and non-rule- 
based Al systems has been driven by two key components: 


1. The huge increase in the amount of data available 


2. The availability of vastly increased computing power in 
both traditional computers and mobile devices, and the 
ongoing development of machine-learning algorithms. 


1.3.1. Data 


Data is vital to the ‘learning’ process of a non-rule-based 
Al system — from model training to testing. Even for rule- 
based Al, experts often perform data analysis and statistical 
hypotheses on sample sets of historical data using statistical 
techniques. Also known as ‘data mining’, this approach 
finds statistically significant data patterns when deriving the 
predefined rules to be programmed into the Al’s knowledge 
base. 


Machine learning and data mining actually share many 
similar statistical models and algorithms. The former 
differs from the latter in that data mining is carried out by 
a knowledge worker on a particular data set with a specific 
goal in mind — namely to discover patterns and extract 
insights from the data set. By contrast, machine learning 
algorithms are carried out by computers automatically or 
semi-automatically. 


Both the quality and volume of training data directly impact 
the predictive accuracy of most Al models. Proper data pre- 
processing, including collection, data profiling, cleansing, 
transformation, and labelling of training data, is critical to 
successful Al development. 


Raw training data can come in various formats: including 
structured data (such as historical equity trading transaction 
records stored in a database), semi-structured data (such as 
social media feeds), and unstructured data - like audio, video 
and still images. ML algorithms require this training data to 
be cleansed and transformed into a defined format that the 
machine can easily read. Speech recognition technologies 
do this, for example, by converting voice recordings into 
machine readable texts that can be used to train a Chatbot. 
To derive insights from data relationships, data scientists 
may also need to assign meanings to training data based 


on domain knowledge, i.e. data labelling. For example, if the 
task is to train an Al to classify scanned image documents, 
data scientists will need to label the type of document (i.e. 
labelled or output data) to each of the scanned images (i.e. 
input data). 


Training ML models requires millions of pieces of labelled 
data. Pre-processing and labelling this data is labour- 
intensive and constitutes most of the work required in the 
ML model training process. As a result, the development of 
machine learning was slow in previous waves of Al. But with 
the recent advances in big data technology (as discussed 
in section 2.1.), large volumes of data can now be efficiently 
collected and processed. 


According to a recent survey conducted by New Venture 
Partners (in which the majority of respondents were financial 
services firms), 21.1% of firms invested more than US$500 
million in Big Data or Al in 2019 (up from 12.7% in 2018). 
Investment in big data infrastructure over the past few 
years has led to the accumulation and analysis of granular, 
detailed data on a very large scale. This data foundation is 
providing the ‘fuel’ to power Al training. 


How does AI derive insights intelligently from the training 
data? This requires advanced mathematical learning 
techniques enabled by efficient computing infrastructure 
and data. 


1.3.2. Computation algorithms 


Non-rule-based computation, particularly machine learning 
algorithms, has become the main way of powering the 
processing intelligence core of Al. This is due to its flexibility 
in enabling the machine to autonomously learn from sample 
data sets and then make predictions without explicit pre- 
programmed instructions. The power of machine learning 
is that models trained by machine learning algorithms 
can also iterate and improve their accuracy as they learn 
from new injections of training data’. This has become the 
foundation of different Al capabilities that mimic the ‘sense’ 
(e.g. computer vision) and ‘act’ (e.g. natural language 
generation) capabilities of humans. 


There are four types of machine learning algorithms: 
supervised, unsupervised, semi-supervised and reinforcement 
learning. 


Supervised learning is a type of machine learning 
algorithm in which humans guide the system by labelling 
the relationship between every variable of input data and 
every variable of output. For example, a supervised machine 
learning model can learn which emails are classified as 
spam by first having a real person go through thousands of 
emails to identify those which are spam and then label them. 
Techniques such as linear regression, logistic regression 
and decision tree classification are used to train supervised 
learning models. 


An unsupervised learning algorithm, by contrast, identifies 
patterns in a set of unlabelled input data automatically. In 
other words, rather than learning the relationships between 
a given set of input and output data, unsupervised learning 
algorithms are designed only to learn whether data patterns 
or structures exist. One of the most common use cases 
is to identify different customer segments with different 
preferences and demographics. 


The third type of machine learning algorithm is semi- 
supervised learning. It is used on data sets which are only 
partially labelled. As it can be costly and time-consuming to 
label large data sets, semi-supervised models are commonly 
applied to real-world machine learning problems. Semi- 
supervised learning combines supervised and unsupervised 
learning algorithms to either infer the pattern or structure of 
input variables, or make predictions about the unlabelled 
data and feed those predictions back to the model as 
training data. 


SI — 
. sc 
- k 2 
> k A 
| N 
IN ee i ‘ey 
w= i o. TA 
A y x { ONA. 
! Na x al 
f A A ~ 
X | í sie ea t y 
7 wn Sd h ; 
y “ J Sey % Y E 
NESNE ay / j 
o “ “A FA 
FM as 
We ” F s+ 


Reshaping Banking with Artificial Intelligence 


The last type of machine learning algorithm is reinforcement 
learning. This determines optimal behaviour based on 
feedback from its environment. Reinforcement learning models 
incentivise machines to optimise behaviour by providing 
positive feedback, or ‘rewards’, which teach the machine to 
strive towards a preferred behaviour. In the absence of pre- 
existing training data, a reinforcement learning algorithm must 
learn to predict the consequences of its actions through trial 
and error. It therefore learns from its own prior experience rather 
than from a predetermined set of examples. Reinforcement 
learning is commonly used to automate the decision-making 
process in interactive scenarios. 


Of all the machine learning types described above, 81% 
of the Al use cases that our survey and research studied 
involved supervised learning techniques. Each bank that 
was interviewed had applied a supervised learning solution 
to at least one use case, making this the most common 
type of Al adopted by our respondents. 


While supervised learning is the most widely used learning 
algorithm, it is also common to see different algorithms 
used in conjunction to make more accurate predictions in 
complex scenarios. 
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Below is an example of how different algorithms work 
together in a complex scenario. 


How do supervised learning, 
unsupervised learning and 


reinforcement learning work 
together in banking operations? 












Most traditional anti-money laundering (AML) systems are 
rule-based and identify a substantial number of suspicious 
transactions. Significant human effort is then required to 
determine whether any of the transactions identified as 
suspicious are in fact false negatives (transactions identified 
as suspicious but which are actually not). Machine learning 
can be applied to identify false negative money laundering 
transactions and save human effort. 


Bank 
Transactions 


Bank 





Training Data Set 


Suspicious 
Transactions 


Suspicious 


Existing AML Transactions 


Rule Engine 


Production Data Set 


Reference: ® 


This problem can be solved with Al. With labelled historical 
bank transactions as training data, banks can use supervised 
learning to train a model to identify whether a transaction 
is a false negative or not. Given the nature of supervised 
learning (where patterns learnt by the model are based 
on historical labelled data), continuous monitoring and 
calibration of the model’s performance is required to ensure 
that its results remain aligned with changes in regulations, 
policies, and bank procedures, and to enable the model to 
identify new patterns that are not available from historical 
data. 


Transactions 


Suspicious 
Transactions 





Reshaping Banking with Artificial Intelligence 


an 





Robot Trainer 


False Negative 






Supervised 


Learnin 
s Not False Negative 






Trained 
Al Model 


Manual review 





SETE False Negative 


by Trained = 


Not False Negative 


Model 


Reporting and 
chain of actions 
following bank 
policies and 
procedures 


The continuous data tagging and monitoring and the 
calibration of the model’s performance are performed by 
a ‘robot trainer’, as explained by some of the banks we 
interviewed. Such activities not only require knowledge of 
data and Al technologies, but also some specific financial 
domain expertise, such as trade finance. The recruitment 
and retention of experts in these areas can be a burden for 
banks. We recommend that banks starting the supervised 
learning journey should not only consider the technologies 
involved, but also the implications for change management, 
organisational structure, staff skill sets and ancillary costs. 


8 Magnim (10 March 2019). Connection between Data Science, ML and Al. https://becominghuman.ai/connection-between-data-science-ml-and-ai- 


dici8d89b0bd 





Clustering Result i Feedback 


Reinforcement 
Learning 
Clusters “ E " l 
: Initial Training Training Using 
ab | Trained AI Model i |: i 
9 : : Legitimate ; 
| | Ud 


Feedback . 
| Bank | i O) Outlier | | ; 
Transactions E—— af te liege Th Anomalies — === 


; | ì Outlier l 
: Outlier ® : : Illegitimate : 
: ‘Transactions: 














Manual 
Review 


Supervised 
Learning 














Banks can also apply clustering and unsupervised 
learning methods without human labelling to identify new 
relevant transaction patterns or features for events such 
as new account opening, loan applications and various 
other banking transactions. With clustering algorithms, 
common transactions are grouped into clusters with 
similar behaviours. These are set as a baseline of normal 
transactions. Anomalies can then be identified as outliers 
that do not have a strong membership in any cluster group. 


When an anomaly-based fraud detection model is trained, 
the system can recognise and automatically notify the user 
of any deviations from the normal pattern. Operational staff 
can then confirm whether the transaction is legitimate. Any 
false alerts can be set as feedback, enabling the machine 
to gain greater accuracy through reinforcement learning. 
Banks that have deployed machine learning-based systems 
report that the false positive rate for flagging money 
laundering risks has been reduced by 20% compared to 
previous systems?. 








9 Faggella, D. (12 December 2018). Bank Reduces Money-Laundering Investigation Effort with Al. https://emerj.com/ai-case-studies/bank-reduces- 
money-laundering-investigation-effort-with-ai/ 
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1.4. Recent Al developments 


Having discussed the importance of data and computation 
algorithms for Al, we next look at how recent Al developments 
have brought the application of these two key components 
to a new level. 


While traditional Al algorithms, particularly machine learning 
algorithms, are robust and effective, they require data 
scientists and domain experts to pre-analyse and break 
down problems into small chunks that can be fed into 
the learning process (with specific algorithms, data and 
features). 


Fortunately, the recent emergence of big data and cloud 
computing (discussed in section 2) has increased Al’s 
autonomy and removed some of the constraints associated 
with the need for human domain experts. As a result, many 
Al use-cases in banking have now become feasible. 


We have summarised below three recent Al developments 
arising from the emergence of big data and cloud computing: 


1. Natural language processing 
2. Deep learning 


3. Collaborative learning models 


1.4.1. Better data pre-processing with Natural 
Language Processing 


Machine Learning can only detect patterns in raw data 
effectively if the algorithm can read and interpret the data. 
Natural Language Processing (NLP) is a branch of Al that 
aids computers to understand natural human language, 
which is usually relatively unstructured structured and 
contains ‘implicit? meanings. Differences in authors’ writing 
styles or cultural backgrounds can also affect how the 
content of natural languages is interpreted. Data scientists 
could manually tag the meaning of each sentence as part of 
data pre-processing, but this would be too time consuming 
with the large volume of training data. NLP can be used in 
conjunction with voice-to-text technology (which transforms 
human voice to machine readable text) and computer vision 
technology (which extracts text from images or videos) 
to pre-process human language and tag the meaning 
automatically before applying a general machine learning 
algorithm to find patterns. 
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NLP uses different techniques to learn and understand 
language, enabling computers to understand not only 
single words and word combinations, but also grammar 
and hence written or spoken sentences. NLP categorises, 
analyses and comprehends the following elements using a 
combination of statistical analysis and rules pre-defined by 
linguistics experts: 


e The language used, such as spelling, grammar, meaning 
and connotations of words 


e The content of the text — namely the message the person 
wants to convey (i.e. logical analysis) 


e The person behind the text, including their style and 
feelings (i.e. sentiment analysis) 


In short, NLP has the ability to recognise linguistic patterns 
and interpret meaning in languages, which it achieves 
through the use of linguistic analysis tools such as part- 
of-speech tagging, automatic text summarisation, and 
entity and relationship extraction. NLP is widely used to 
process large volumes of unstructured text in big data to 
reveal patterns, to identify and tag relevant information 
and to summarise document contents — all with minimal 
human input. 


Traditionally, NLP methods have mostly employed ‘shallow’ 
machine learning models, which use a limited set of input 
data features defined by human experts and based on 
their domain knowledge. This is because some linguistic 
information can be represented in many different ways 
and it can be challenging to derive accurate patterns 
from a multiplicity of features (the so-called ‘curse of 
dimensionality’). With the growing use of deep learning, 
neural-based NLP models have achieved superior 
performance compared to traditional models — without the 
need for task-specific feature engineering. 


NLP can satisfactorily process major languages (e.g. 
English and Mandarin). It can also collaboratively work 
with computer vision and voice-to-text technologies for 
applications and use cases such as real-time translation, 
regulatory requirements analysis, legal document reviews, 
compliance reviews, intent parsing for Chatbots, customer 
call record analysis, and sentiment analysis for social 
listening. 


The mixed language environment in Hong Kong (where 
speakers often use English, Mandarin and Cantonese 


interchangeably) and the relatively small population of 
Cantonese speakers has created barriers for the wider 
local adoption of NLP. Research and development by Hong 
Kong-based technology firms such as Fano Labs and 
organisations such as the Hong Kong Applied Science and 
Technology Research Institute (ASTRI) have addressed 
these challenges in recent years. This has helped put NLP 
centre-stage among Al applications in Hong Kong. 


1.4.2. Complex rules and deep learning 


Traditional Machine Learning usually requires data scientists 
to select key data features for prediction, as it is not efficient 
to run the learning algorithm with too many parameters. 
However, some subtle patterns may thus be missed and 
prediction accuracy may suffer as a result — especially with 
highly unstructured data, such as images or voice. 


Deep learning, a subset of machine learning, represents 
a breakthrough in the field of Al. As the name suggests, 
deep learning ‘learns’ by modelling hundreds of thousands 
of deep neural network layers, inspired by the human 
brain, which can have billions of permutations. Deep 
learning is powerful enough to recognise and learn subtle 
structural patterns from vast amounts of data, enabling 
it to recommend decisions or make predictions. Deep 
learning requires a huge amount of computing power and 
data to train a model with sufficient predictive power. Its 
applications were therefore limited until the big data era 
of the last decade. Since then, deep learning has gained 
substantial traction. 
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Lam, A. (11 July 2019). Fano Labs: Multilingual ASR and NLP. https://www.miotech.com/en-US/insights/article/5d26dd98d0728f0044b21639 





Neural networks (NNs) are the building blocks of all deep 
learning techniques, and are inspired by the structure of the 
brain. The basic unit of computation in a neural network is a 
neuron. A neuron calculates an output based on information 
from other neurons or from an external source. Through the 
hidden layer architecture formed by connecting neurons, 
deep learning extracts high level features and learns 
Classification incrementally with each additional layer of 
neurons. Deep learning algorithms are difficult to explain 
because each hidden neuron represents a non-linear 
combination of all the previous neurons. 


With the explosion of data in the big data era, huge amounts 
of structured and unstructured data — including images, 
voice data and digital footprints — have been captured in 
company databases or on the Internet. Learning algorithms 
trained using big data can discover connections between 
subtle or complex features of data that humans cannot 
easily find or describe logically. 
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The latest advances in unsupervised deep reinforcement 
learning from DeepMind’s AlphaGo Zero research"! show 
that in certain situations Al can be surprisingly powerful, 
even without initially inputting and labelling training data. Al 
algorithms have already demonstrated the ability to outstrip 
the performance of even the most experienced humans 
in speech recognition’, facial recognition, playing games 
such as chess and Got’, and detecting loan or credit card 
fraud. 


One important difference between deep learning and 
traditional machine learning is the performance as the 
volume of data increases!*. Deep learning algorithms do not 
perform well when the data volume is small. With greater 
data volume, deep learning algorithms can outperform 
traditional machine learning algorithms. 


Performance of machine learning technigue vs amount of Data 


Performance 


Amount of Data 


Deep Learning 


Traditional 
Machine Learning 








" Silver, D., & Hassabis, D. 


Li, A. 
googles n-recognition-humans/ 









(18 October 2017). AlphaGo Zero: Starting from scratch. https://deepmind.com/blog/article/alphago-zero-star 


June 2017). Google’s speech recognition is now almost as accurate as humans - 





ng-scratch 





9to5Google. https://9to5google.com/2017/06/01/ 


3 Silver, David, et al. (19 October 2017). “Mastering the Game of Go without Human Knowledge”, Nature 550, 354-359. https://www.nature.com/articles/ 


nature24270 


14 Analytics Vidhya. (8 Apr 
learning-machine-learning/ 








il 2019) Deep Learning vs. Machine Learning. https://www.analyticsvidhya.com/blog/2017/04/comparison-between-deep- 
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Below are some of the common applications of narrow Al enhanced by big data and deep learning in banking: 
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bot assistants. 


The above examples show that deep learning has had 
promising results in processing unstructured data by 
automating the process of extracting features from the 
datasets. Research is being actively conducted to improve the 
efficiency of deep learning algorithms and make them fit for 
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generation 


Natural Language Generation (NLG) is an automated Al 
process that generates custom content by transforming 
structured data into natural language. The aim of the 
algorithm is to convey a message by accurately predicting 
the next word and forming a sentence that is readable by 
humans. 


Early attempts at NLG involved predicting the next word in a 
sentence based on the current word. However, there was a 
high degree of inaccuracy, as this method ignored the wider 
context and the structure of preceding sentences. 


A new deep learning-based NLG approach uses the Long 
Short-Term Memory neural network (LSTM). This involves 
the neural network model building and remembering 
the correlations of multiple words after considering the 
seguences of all the key words in the previous sentences. 
This significantly increases the accuracy of sentences 
generated because the word predictions take context and 
structure into account. Some NLG approaches can further 
predict the customer’s emotional profile based on previous 
conversations, or the content of marketing campaigns. They 
can then personalise the message and style of the language 
generated using a predefined database that includes words 
tagged to reflect different emotional responses. 


Some banks have adopted deep learning based on NLG 
to generate personalised e-marketing materials, and have 
successfully doubled the click rates on digital advertisements 
for credit cards and mortgages loans"*. 





general purpose. More practical applications are expected to 
be enabled by deep learning once banks accumulate enough 
data to meet the heavy data requirements for training deep 
learning models. 


185 Cheng, M. (7 August 2019). JPMorgan Chase has found software that writes better ads than humans can. https://qz.com/work/1682579/jpmorgan- 


chase-chooses-ai-copywriter-persado-to-write-ads/ 








1.4.3. Model training with collaborative learning models 


Conventional machine learning and deep learning models 
are not designed to be trained collaboratively by multiple 
parties. They are also privacy-intrusive, as the training data 
set needs to be stored within a centralised database and be 
aggregated using servers within one company’s data centre. 
The centralised nature of conventional model training has 
limited Al development, as no single organisation has 
access to all available data or computing power. 


Federated Learning, recently adopted by tech pioneers 
such as Google", is a new machine learning approach that 
allows organisations to collaborate on machine learning 
model training without giving away proprietary data. 
Conceptually, federated learning enables a shared machine 
learning model to be trained by multiple machines, based 
on distributed data sets. In practice, both the training and 
data reside in a local machine, but the results are encrypted 
and shared with collaborating machines. 
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Eliminating the need to store all the training data in a 
centralised database addresses growing concerns about 
data privacy, especially in light of new privacy legislation, 
such as the EU's General Data Protection Regulation 
(GDPR)!7. Corporates such as WeBank have expressed 
interest in the development of federated learning, and 
WeBank has contributed its Federated Al Technology 
Enabler (FATE) framework for use by the open source 
community so as to assist other banks and corporations to 
onboard the technology!?. 


Federated learning can also be applied to consumer 
devices, which are potentially powerful machines for 
co-training a model using local data sets. According to 
statistics from Ericsson!?, the total number of smartphones 
being used around the world had reached almost 5.6 
billion as of June 2019. The machine learning process 
could become much faster and cheaper if all the unutilised 
computing power on smart phones was contributed via 
federated learning. 


1$McMahan.B. & Ramage. D (6 April 2017). Federated Learning: Collaborative Machine Learning without Centralized Training Data. https:// 


ai.googleblog.com/2017/04/federated-learning-collaborative.html 





17 General Data Protection Regulation. https://eugdpr.org/ 


18 Federated AI Ecosystem. Collaborative Learning and Knowledge Transfer with Data Protection. https://www.fedai.org/ 





19 Ericsson (18 September 2019). Ericsson Mobility Visualizer: Interactive mobility data app. https://www.ericsson.com/en/mobility-report/mobility-visua 


lizer?f—-1&ft-1&r-1&t-8&s-1&u-1&y-2019,2024&c-1 








29 Patki, N., Wedge, R., & Veeramachaneni, K. (2018). The Synthetic data vault. Data to AI Group. 


https://dai.lids.mit.edu/wp-content/uploads/2018/03/SDV.pdf 





21 Edge computing is a distributed, open IT architecture that features decentralised processing power, enabling mobile computing and Internet of 
Things (loT) technologies. In edge computing, data is processed by the device itself or by a local computer or server, rather than being transmitted to 

















a data centre. https://www.hpe.com/hk/en/what-is/edge-computing.html 
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Apart from federated learning, two other trends are important 
for the future development of Al. These are Synthetic Data 
Creation and Transfer Learning. 


Synthetic Data refers to data that has been mimicked 
to include statistical properties that are similar to those 
contained in real data collected from actual business 
transactions. Since synthetic data is generated, there is no 
limit to the amount of data or scenarios that can be derived. 
This addresses a key challenge to Al development. Synthetic 
data is also not subject to data privacy concerns or modelling 
constraints, such as missing values and non-responses. 


The ability to generate synthetic data is particularly important 
in use cases such as fraud detection, where there may not be 
enough real fraudulent data and related patterns available due 
to a bank’s limited transaction volumes for a particular type of 
fraud or its location. 


Recent research by MIT” shows that the prediction accuracy 
of machine learning models based on synthetic data can be 
just as good as that based on real production data. 


Transfer Learning is another important machine learning 
technigue that allows pre-trained models in a particular 
domain to be used by others as a starting point for developing 
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a new model in a similar domain. This accelerates the whole 
training process and performance of a new and related 
model by eliminating the need to retrain the same model 
from scratch. The insights which a data scientist has spent 
a tremendous amount of time, effort and proprietary data to 
develop can thus be passed on to other teams developing 
a model in a similar domain, resulting in an accelerated 
timeline and reduced effort. This machine learning method 
represents a new way for AI researchers and corporations 
to share knowledge accumulated in machines. 


Efficient data exchange enabled by combining 'edge 
computing? and high speed networks means that recent 
developments in collaborative learning systems hold a lot 
of promise for the banking industry. There have been a few 
successful pilot implementations of transfer learning and 
other collaborative learning in multiple scenarios. These 
include learning mobile user behaviour and training a 
common fraud detection system. Once transfer learning and 
federated learning mature, AI models may be able to learn 
more quickly in a collaborative manner, solve challenges 
for banks, and help the industry by building learning 
models from cross-domain data without compromising 
data security and privacy. This will be particularly useful 
for recently established banks, which will have little or no 
historical data. 








4 








Technology foundations 
that enable Al deployment 


Key Takeaways 


The four technologies that enable AI development are big 
data infrastructure, edge computing & 5G, cloud computing, 
and open source technology. 


Big data infrastructure has delivered a huge boost to Al by 
efficiently storing and pre-processing learning data for Al 
training. 5G has cut down data transmission times and made 
data exchange much more efficient, while edge computing 





allows machine learning to take place on devices like 
smartphones without the need for a cloud. Accessible cloud 
computing is enhancing the total computing power available 
for Al. Open source technologies make contributions from Al 
communities and major tech giants available to all, lowering 
costs and enabling software innovation. 
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The pace of Al deployment has been far from steady. 
While there has been a great deal of government-funded 
research, such as the ‘Al Next Campaign’ carried out by the 
Defense Advanced Research Projects Agency (DARPA), Al 
remained in the confines of academic research for many 
years. Insufficient computing power and a lack of freely 
available data were brakes on Al’s progress. 


In the last few years, however, Al has grown from a simple 
rule-based expert system that implements pre-defined 
logical rules to one that feeds data-driven, decision-making 
applications. This has not only been due to the advanced 
research that has been carried out on the underlying Al 
algorithms, but also to other related technologies. 


This section mainly focuses on the key technology 
foundations that underlie the current and future waves 
of AI development. By keeping track of trends in these 
technologies, banks can better plan for AI deployment. 


2.1. Big Data infrastructure to store 
learning data 


As mentioned in Section 1.3.1, the guality and volume of 
available data are major determinants of Al development. 
The proliferation of computers, mobile phones, smart home 
appliances, Internet of Things (IoT) devices and social media 
means that data volume is increasing at an ever faster pace, 
while data structure has become more complex. According 
to the sixth edition of Domo's Data Never Sleeps report”, 
2.5 guintillion bytes of data are estimated to be created by 
the Internet every day. 


Data scientists have always described big data as having 
five distinct dimensions: Velocity, Volume, Value, Variety 
and Veracity (The Five V's). Velocity refers to the speed at 
which data need to be analysed while it is being generated. 
Volume is the amount of data generated each second. Value 
is shorthand for the value of the data being collected and 
analysed. Variety means the number of data types, which 
are usually unstructured. Veracity is the guality and accuracy 
of the data. To derive meaningful insights, big data needs to 
be processed with advanced analytics — so-called 'big data 
analytics'. 
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Compared to traditional data management tools, Big Data 
analytics infrastructure is typically designed from scratch 
to divide time-consuming data processing tasks for billions 
of records into manageable pieces. It utilises parallel 
computing processors and distributed data storage across 
hundreds of servers to shorten the data analytics process 
from days to minutes. 


In the past decade, banks have made significant 
investment in big data analytics infrastructure for efficient 
data collection, processing and management. This has 
laid a foundation that is generating more ‘fuel’ for training 
machine learning models, resulting in better insights and 
more accurate results. 


2.2. Efficient exchange of data with 
edge computing and 5G 


Efficient exchange of data is another enabler of successful 
Al deployment, especially for real-time Al applications. 
Edge computing technology and 5G cellular networks can 
distribute data with significantly reduced communication 
times for data exchange. 


Edge computing is done at or near the source of data, rather 
than relying on centralised data centres for processing, thus 
reducing communication turn-around times. With edge 
computing, most processes can be done locally on personal 
devices, such as cell phones or self-driving cars because 
the edge devices have sufficient computing power in their 
own right. Some are even equipped with chips specifically 
designed for running Al algorithms. 


Edge computing has a number of benefits that are boosting 
the deployment of Al. It mitigates the problem of latency 
and Internet bandwidth (such as the delay in response time 
in cloud computing) by reducing reliance on centralised 
servers for data processing. It also enhances security and 
data privacy, as local devices can pre-process data and 
only share selected pieces with the cloud. This means that 
wearable devices or sensors have become more ‘intelligent’ 
in that they know what information to keep, what to transmit 
and how to use the information they have. For example, 
an iPhone with Face ID can perform facial recognition Al 
using the device’s own processor without sending sensitive 
biometric information to cloud servers. 
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5G - the fifth generation of cellular network technology — 
is complementary to edge computing. The speed of 5G is 
expected to be 2Gbit /s, a hundred times faster than its 
predecessor 4G*4. There are also significant improvements 
in latency, from 40ms for 4G to 1ms for 5G. Fast internet 
speed and short latency allow multiple edge devices to 
exchange data much more efficiently, and thus to participate 
in machine learning model training collaboratively. 


The deployment of Al has been accelerated thanks to the 
efficient exchange of data enabled by edge computing 
and 4G or 5G high-speed networks. Part of the machine 
learning process can be run directly on edge devices to 
power real-time Al applications, such as facial recognition. 
This allows smart devices to provide insights and predictive 
analysis in real time, which helps to create better customer 
interactions. High efficiency in data exchange also enables 
collaboration in machine learning model training, as it splits 
the training process across many individual local devices. 


2.3. Accessible cloud computing to 
enable Al algorithms 


Cloud computing was a precursor and facilitator of the 
emergence of big data. It refers to the provision of computer 
resources, such as computing power, networks and data 
storage, that are shared by users on an on-demand basis. 
The resources offered by cloud service providers are 
typically powered by massive server infrastructures across 
multiple data centres around the world, made available to 
a huge number of users at low cost due to their economies 
of scale. 


Cloud computing technology has the potential to change 
the way companies do business. According to a PwC 
survey?®, companies that integrate the cloud into their 
business strategy benefit from improved time-to-market, 
flexibility to meet changing customer needs, and ease of 
integrating with existing infrastructure. The same benefits 
apply to Al model training. In almost all the interviews we 
conducted on this topic, banks also saw cloud computing 
as a key enabler for adopting Al. 


Modern ML algorithms are data and computing-intensive. 
The cloud’s scalability in terms of computing power helps 
shorten the training time for Al models by enabling them to 
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process a tremendous amount of data and to run algorithms 
‘in the cloud’. Its on-demand flexibility also lowers the effort 
and lead time required to set up relevant infrastructure. 
These characteristics of cloud services suit financial 
institutions well, as many are still at the early, exploratory 
stages of their Al journey. 


2.4. Improved economics and 
collaboration through open source 
technologies 


Open source technology is another key enabler for 
organisations wishing to deploy Al. This decentralised 
software development model has been adopted by most 
of the technology giants. Microsoft, Google, Facebook and 
Amazon have all made remarkable progress in developing 
Al systems, and they have released much of their work to 
the public for free in recent years. China’s Al giant Tencent 
has also donated its federated Al framework, Acumos, to 
the open source initiative. 


Unlike proprietary software products, which usually have a 
price tag for initial licensing as well as regular upgrades and 
maintenance, open source software products are often free to 
download and modify, thereby lowering the cost of ownership. 
While cost is one benefit, open source also provides a solid 
foundation for innovation. By bringing together large knowledge 
pools and significant expertise?, open source communities 
collectively contribute to the continuous development and 
improvement of software. 


In banking, open source development has provided data 
scientists and in-house technical teams with many handy 
off-the-shelf tools. These include Tensorflow”’ (from Google) 
and PyTorch” (Facebook), which can be used to quickly 
apply the latest machine learning technology to business 
problems. 


Open source software also lowers costs and reduces risks 
for banks in their adoption of Al. The open source ecosys- 
tem widens the adoption of practical use cases, which can 
be a useful reference point for banks wishing to improve ex- 
isting models that they have developed in-house. By riding 
on the core Al technologies (e.g. deep learning algorithms) 
contributed by the open source community, start-ups and 
vendors in the banking industry can reduce development 
costs and focus more on business applications. With cost 
savings and risk reduction, the adoption of Al applications 
in the banking industry can be accelerated. 
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Industry adoption of Al: 
drivers and barriers 


Key Takeaways 


A number of drivers of and barriers to Al adoption were 
identified from our survey (see Appendix B). 


The three drivers of AI adoption identified are the need 
to improve customer experience, the need to stay cost- 
effective, and the need to better manage risk. In Hong Kong, 
82% of customers have experienced at least one problem or 
frustration with their banking experience. Banks can leverage 
Al to improve customer service by providing services like 
personalised recommendations and by delivering better 
customer support. At the same time, Al can also help banks 
to reduce costs through functions such as task automation. 
Finally, Al can simplify the risk management process and 





develop more comprehensive insights into customer risk by 
taking data such as transaction history, market trends and 
customer credit history into consideration. 


Three major barriers to Al adoption were also identified. The 
first is lack of explainability, which means that results or 
recommendations generated by Al are difficult to explain. 
A lack of resources, including Al talent and quality data, 
is another obstacle. Furthermore, as banking is a highly 
regulated industry, banks may face additional costs and 
risks in adopting Al due to the need to comply with Al- 
related regulations. 
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Our survey (see Appendix B) found that Al applications 
are on the increase across the Hong Kong banking sector 
— especially in retail banking. 89% percent of retail banks 
have adopted or plan to adopt Al applications; the total 
amount of capital investment by Hong Kong’s retail banking 
sector is expected to increase by 70% in the coming five 
years. 


Since 94% of banks that have adopted AI say that they will 
use it to shape their corporate strategy, it is important to 
understand the reasons driving this increasing adoption. In 
the following sections, these drivers are categorised and 
explained under three main headings: 


e The need to improve customer experience 
e The need to stay cost effective 


e The need to better manage risk 


3.1. The need to improve customer 
experience 


As many business processes are digitalised and 
streamlined, customers are developing higher expectations 
of products and services. According to PwC’s 2019 Survey 
Virtual Banking: Customers Take Charge — Are You Ready?*9, 
70% of customers surveyed in Hong Kong are interested 
in more personalised and relevant information and tools. 
Below are two of the main expectations of these customers: 


68% 


Savings and 
investment advice 
based on my current 
life stage 


58% 


Personalised 
goal-setting and 
tracking tools 


- 


ML 
ES 





29 PwC (November 2019). Virtual Banking: Customers Take Charge — Are You Ready? https://www.pwchk.com/en/financial-services/publications/2019- 


digital-banking-customer-survey.pdf 
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These results suggest that consumer preferences are 
shifting to personalised digital services that are able to 
combine all their product and servicing needs, including 
touch points such as ATMs and branches, while also 
adjusting to the customer’s different life stages. However, 
not all traditional banks are able to meet these preferences, 
leading to customer frustrations. Reported problems include 
long branch waiting times, poor online experience, and lack 
of any consolidated view of their finances. From the same 
survey, we found that the percentage of customers in Hong 
Kong experiencing at least one problem with their bank was 
82%: 


Customers experiencing at least one problem or 
frustration with their bank 


Hong Kong 


82% 





These frustrations are among the reasons for the rapid 
development of FinTech companies, which are more capable 
of delivering a personalised experience to customers 
through digital solutions such as Al. For example, Alipay, the 
Chinese Fintech giant, has been using Al for credit scoring, 
smart customer support and fraud detection?9, 


According to The Future of FinTech in Hong Kong*", a report 
issued by the Hong Kong Financial Services Development 
Council, FinTech companies are providing millions of 
people with access to financial services for the first time, 
largely due to the increased use of mobile phones and the 
decreased cost of servicing customers. Without the barriers 
of legacy systems to contend with, FinTech companies have 
the agility to define new customer digital journeys using 
emerging technologies, including Al. Traditional financial 
institutions are, therefore, being severely challenged in 
terms of providing customised services and innovative 
products. 
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Banks are recognising that they can raise their 
competitiveness against FinTech companies by better 
understanding their customers' behaviour, preferences and 
product needs. Some are looking to utilise their internal 
customer data and combine it with external data to form 
data partnerships that can generate detailed insights into 
customer behaviour, enabling them to provide personalised 
product recommendations to customers based on customer 
profile, transaction history and other data. This is one of the 
key drivers of their adoption of Al and big data analytics. 


Open banking initiatives are further unleashing the 
possibilities of Al to enhance customer experience. These 
initiatives involve opening up internal bank data and helping 
banks connect and collaborate with third-party service 
providers, such as data or product providers, through an 
Application Programming Interface (API). APIs allow the 
secure sharing of customer-authorised financial data, 
and the distribution of partner-based products to bank 
customers. By also utilising big data analytics, banks can 
design unique, personalised products that enhance the 
customer experience. 


Such initiatives are giving customers a more integrated 
banking experience across business applications, and 
are also enabling innovation and cross-industry business 
collaboration. Recently, a global bank announced a free 
personal financial management tool for the public in 
exchange for data from third-party service providers that 
would help it to enhance the tool’s performance”. By 
aggregating data from separate sources, the bank expects 
to increase consumer engagement and put itself in a better 
position to proactively offer personalised products to suit 
customers’ needs as and when required. 


To remain competitive, there is a need for banks to provide 
a better experience to their customers in order to maintain 
and grow their market share. Al can help banks provide 
more flexible and comprehensive solutions to customers, 
thus creating a more personalised and unique customer 
journey. 















31 FSDC (May 2017). The Future of FinTech in Hong Kong. https://www.fsd 
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3.2. The need to stay cost-effective 


Given an ever-changing market and keen competition, it is 
vital for banks to find faster, more economical, and lower- 
risk approaches that will enhance their cost efficiency and 
overall profitability. This is another strong driver for banks 
to seek help from emerging Al solutions, which make large- 
scale automation of business processes possible. 


After decades of work in digitalising the banking sector, 
use cases focusing on customer-facing and front-end 
operations, such as online banking, are fairly sophisticated. 
However, back-office functions such as risk management, 
compliance and technology have progressed more slowly. 
Most are still people- and paper-intensive. Paperwork 
has always been a big concern, as it is one of the main 
contributors to operating costs. Manual processing is not 
only slow, but also increases inconsistencies and human 
error. 


Against this backdrop, the emergence of the concept of 
‘virtual banks’ has introduced banks to a way of facilitating 
cost-efficiency in their daily operations. This concept was 
welcomed by the Hong Kong Monetary Authority (HKMA) 
in September 2017 when it launched a number of initiatives 
under the tagline A New Era of Smart Banking*.. Both 
financial firms (including existing banks in Hong Kong) and 
non-financial firms (including technology companies) may 
apply to own and operate a virtual bank in Hong Kong. The 
HKMA received over thirty applications, and eight licenses 
were granted in the first two quarters of 2019. 


Virtual banks are acting as a catalyst for new developments 
across the banking sector, both digital and physical. They 
are focussing on hyper-personalisation and other new 
business models that can be realised through the adoption 
of new technologies, such as big data, and by leveraging an 
ecosystem of partners. With the increased adoption of new 
technologies, virtual banks are also able to enhance their 
own cost-efficiency. Without traditional physical branches 
and paper-based processes, virtual banks greatly rely on Al 
and data analytics to fully digitalise the customer experience 
while staying cost-effective. 


The HKMA expects that virtual banks will enhance financial 
inclusion for retail customers and small and medium-sized 
enterprises (SMEs) in Hong Kong. For example, these 
banks should not impose any minimum account balance 
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requirement or low-balance fees on their customers 
according to the HKMA’s Guideline on Authorization of 
Virtual Banks, they should also be able to offer more 
price-competitive products given their lower-cost income 
ratios. When it comes to welcoming SMEs as customers, 
traditional banks have been deterred by onboarding costs, 
but their automated procedures should make this segment 
more attractive for virtual banks. 


In the face of increasing challenges from virtual banks, 
traditional banks are exploring automation opportunities 
to stay cost-effective. However, not every scenario can be 
improved by automation alone. For example, KYC and AML 
checks are time-consuming, as they require humans to read 
through all the documents. This situation cannot be improved 
by using automated solutions alone, as a computer cannot 
‘read’ a document. For some more complicated processes, 
lack of IT integration leads to back-office staff entering data 
manually from one system to another to drive the workflow. 
To streamline business processes, alleviate operational 
costs and reduce the chance of errors, Al has become an 
important option for banks. 


With the help of Al technologies such as natural language 
processing (NLP) and speech recognition, automated 
solutions are now able to handle cognitive processes 
such as reading through a document (e.g. with Intelligent 
Workforce) or listening to a recording (e.g. using voice and 
speech analysis). This is helping banks reduce costs. In 
addition to performing simple cognitive tasks, Al is also 
capable of making more accurate and precise predictions, 
in conjunction with the increasing amount of data sharing 
with business partners being enabled by open API. The 
HKMA introduced its Open API Framework for the Hong 
Kong Banking Sector in July 2018*. This aims to facilitate 
the development and wider adoption of APIs within the 
sector. By the end of its Phase I implementation, twenty 
participating retail banks had made more than 500 Open 
APIs available, offering access to information on a wide 
range of banking products and services®. With Open 
API, all information can be digitalised and understood by 
computers, and the systems involved can be more easily 
integrated to achieve end-to-end automation (including 
data collection, data integration and AI outputs). 


3.3. The need to better manage risk 


Risk management is one of the major concerns of the 
banking sector: a careless mistake or ignorance of basic risk 
management can lead to disaster. However, integrated risk 
management can be a very complex and tedious process, 
as it requires high level accuracy and confidentiality. Al can 
simplify the whole risk management process by taking data 
such as transaction history, market trends and customer 
credit history into consideration, in the process developing 
more comprehensive insights into likely risk scenarios. 


According to PwC’s research on open banking®®, the use 
of alternative data — such as a borrower's employment 
history — enables banks to carry out a more comprehensive 
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credit assessment on each borrower. The availability of 
alternative data through partnerships with other types of 
institutions (non-banks), facilitated by API integration, is 
enabling non-traditional risk modelling, in which alternative 
data can be used for credit scoring and real-time dynamic 
pricing. Put together with Al models and big data analytics, 
this is enabling other aspects of customers and customer 
behaviour to be considered when making lending decisions. 
The loan market can thus be expanded while maintaining 
control over credit risk, as the bank’s ability to consider 
the right variables means that more borrowers are able to 
obtain loans. The graph$$ below illustrates the difference 
when alternative data is considered as part of the credit risk 
assessment process: 


Risk-adjusted market expansion 
By considering external factors, a 
more comprehensive and accurate 
estimated loss rate enables more 
loan approvals 


Pricing efficiency 

Enabled by the lower estimated loss 
rates, borrowers pay a lower rate for 
a fixed term loan than the traditional 
models 


630 640 650 660 670 680 690 700 710 720 730 
Weighted Average FICO Score (FICO Score is the credit scoring method) 


Al can also help with other risks, such as money laundering 
and fraud events. For instance, when banks are going 
through the AML process, data points such as personal 
travel or historical transaction statistics can be used to 
help predict whether customers have any connection 
with potential overseas / multi-country money laundering 
activities. Al can also make more accurate predictions when 
identifying potential operational or legal risks. 








news-and-media/press-releases/2018/07/20180718-5/ 











releases/2019/07/20190731-3/ 





Other types of risk management - such as market, 
regulatory or liquidity risk management — could also be 
improved with more data. Al enables the banking sector to 
focus on analytics and prevent losses in a proactive and 
real-time manner, rather than tackling them afterwards. 
Better risk management is clearly a strong motivator for 
banks to adopt Al. 


34 HKMA (17 July 2018). Open API Framework for the Banking Sector and the Launch of Open API on HKMA's Website. https://www.hkma.gov.hk/eng/ 





KMA (31 July 2019). Open API Framework for the Banking Sector: One year on. https:/Awww.hkma.gov.hk/eng/news-and-media/press- 





38 PwC (June 2018). Opening the bank for a new era of growth. hitps://www.pwc.com/il/he/bankim/assets/2018/Opening9620the9620bank9620for9020 
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3.4. Barriers to Al adoption 


While Al has been widely adopted in retail banking, total 
adoption across the whole banking sector still has room 
to grow. According to our survey, just 48% of the entire 
Hong Kong banking sector has adopted or plan to adopt 
Al applications. Local banks, in particular, are still hesitant 
about applying Al technologies. 


Some of the major barriers to Al application are as follows: 


e Explainability 


- Building trust and safety mechanisms and complying 
with regulations through transparent, explainable 
algorithms 


Our business 
processes require 
face-to-face interaction 
with ee 


Making needed 
operational changes 
is difficult 


Have difficulties in 
analytical techniques 


Results of Al 
applications are not 
robust 


Our institution has 
not established 
formal processes to 
govern and manage 
Al projects 


Customer 
related barriers 





Environment- 
related barriers 










Changing corporate 


es culture is difficult 
Organization- 


related barriers 


Policy-related 
barriers 





Al readiness- 
related barriers 





Al application 
result-related 
barriers 





Our institution has 
not established a 
formal prioritization 
process for Al 
investments and 
projects 


Al governance- 


related barriers 





In the following sections, these barriers will be discussed 
and their implications examined. It is important for banks to 
understand the potential execution challenges so that they 
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e Resources 


- Identifying talent, collaborating and outsourcing to 
implement Al solutions 


- Developing data infrastructure to process Al solutions 
and store data 


e Regulations 


- Establishing governance for ethical Al, data privacy 
and other security issues 


- The changing landscape of regulations required in a 
complex digital world 


Customers do not 
trust Al 





Costs in resolving 
customer doubt in 


Low government- 
provided incentives 





Al solutions 

Likelihood of Insufficient 

success is low support from top 
management 


Obsolete laws 
prevent financial 
innovation based 
on Al 


There is no business 
context to apply Al 





The computing 
power is not 
sufficient 


Al cannot ensure Results of Al Directory 
data security I are E Explainability 


E Resources 
Regulations 


Our institution 

does not have a 
steering committee 
at executive or 
senior management 
level responsible 
for determining Al 
development 


CIO does not have a 
direct reporting line 
to the CEO 


The CIO or a similar 
role cannot clearly 
articulate a vision 
for Al’s role in our 
institution 


can better estimate the time and effort needed to implement 
Al solutions. 


3.4.1. Lack of explainability 


Al represents significant automation opportunities. 
However, as it becomes widely adopted in decision- 
making processes that do not offer clear explanations, it 
also represents a ‘black box’ threat, meaning that it may 
follow biases it has learnt from previous examples of poor 
human judgement. The lack of trust in such decision- 
making, and the lack of guarantees for stakeholders who 
may be affected by these decisions, are major barriers to Al 
adoption. This is true not only in the banking industry but in 
virtually the entire business world, according to PwC's 22nd 
Annual Global CEO Survey”. lt is expected that concerns 
about Al’s trustworthiness will decline as it becomes more 
widely accepted and more records of proof are generated. 
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According to the High-level Principles on Artificial Intelligence 
circular published by the HKMA in November 2019, 
banks should ensure an appropriate level of explainability 
of AI models to all relevant parties. They should build their 
capabilities in understanding, validating and explaining how 
the AI arrives at its recommendations or decisions as soon 
as possible. 


According to the survey, 4996 of financial institutions in 
Hong Kong have faced major challenges in validating the 
fairness and accuracy of their Al models (e.g. biased or 
discriminatory decisions), and 3296 have faced challenges 
in developing and launching AI applications due to a lack of 
understanding of Al models (e.g. black-box risk, adversarial 
attacks). 


84% of CEOs agree that Al-based decisions need to be explainable in order to be trusted 
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Many banks, including those which have already started 
their AI journey, expressed concerns on the part of their 
executive team and board about the difficulty of assessing 
the implementation risks. 


Banks should implement adeguate measures during the 
design phase to ensure a level of explainability which is 





appropriate and commensurate with the materiality of 
their AI applications. After banks define the level of their 
explainability requirements, they need to consider the 
choice of AI algorithms, data processing, additional software 
components and application design at the early stages of 
the design phase. This could be challenging for banks due 
to a lack of accepted standards and existing practice. 


37 PwC (2019). 22nd Annual Global CEO Survey. https://www.pwc.com/mu/pwc-22nd-annual-global-ceo-survey-mu.pdf 





88 HKMA (1 November 2019). High-level Principles on Artificial Intelligence. https://www.hkma.gov.hk/media/eng/doc/key-information/guidelines-and- 


circular/2019/20191101e1.pdf 
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3.4.2. Lack of resources for Al adoption 
Shortage of talent 


As an emerging technology, Al suffers from a shortage 
of expert talent. It is difficult for banks to compete 
with technology companies and FinTech companies to 
recruit the best on the market. Recruiting talent who are 
knowledgeable about both Al and banking is essential 
for Al development, but vacancies remain unfilled. Some 
banks are willing to take on inexperienced individuals and 
train them, but this takes time and resources. This talent 
shortage was ranked as the top challenge to their Al journey 
by some of our interviewees. 


How does your institution develop your current 
Al capabilities? 


Present 


Some banks have chosen to outsource or develop other 
collaborative models to address the talent gap. According 
to our survey, 85% of financial institutions adopting Al 
technologies in Hong Kong have considered appointing 
external IT and technology companies to provide manpower 
resources for their projects. As partnerships are the most 
common approach to building Al capabilities, there is an 
increased demand for partnering with external consultants, 
innovation hubs, incubators and academic institutions. 


Future 
(coming 5 years) 


Ranking Ranking 


Internal research and development 
Talent acquisition 


Working with consultants 


Collaborating with academic institutions 


Outsouring R&D to technology firms 





Although partnering with technology firms can help banks 
manage the talent shortage, it brings with it other risks and 
concerns. 


Al is an emerging technology without established standards, 
meaning that a diverse range of solutions is available. 
However, the proprietary algorithms developed by Al 
companies need continuous maintenance and calibration. 
This is a challenge for banks in terms of managing 
partnerships and intellectual property rights with external 
vendors. These Al companies are usually start-ups, so 
traditional vendor assessment procedures may not apply. 


Shortage of quality data 


The quality of the data used for training and validating Al 
models has a great impact on model performance. It is 
critical for Al models to use production data for training 
and validation before deployment to ensure the algorithm 
is applicable to real life situations, otherwise there is a risk 
of unexpected Al model behaviour. However, banks face 
challenges in collecting machine-readable (internal and 
external) production data for Al model training in Hong 
Kong, for three main reasons: 
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e Insufficient amounts of open public data 


* Concerns about the collection, sharing and use of 
customer data and data privacy 


e The need for significant cleansing efforts for unstructured 
or poor quality data 


As reflected in some of the interviews, there is no well- 
established database for public (or government) data 
in Hong Kong, bringing challenges for the development 
of Al models for some specific use cases. For instance, 
developing deep learning enhanced OCR recognition for 
identity cards needs a vast amount of data in the form 
of copies of real identity cards for training and accuracy 
testing. Banks may be unwilling to use copies of their 
clients’ documents due to data privacy concerns. However, 
acquiring an official training dataset from the government 
can take more than a month, as both parties need to go 
through an internal compliance process. Given the iterative 
nature of machine learning and deep learning, in this case 
the model training period could be unexpectedly long for 
banks - possibly undoing the comparative advantage 
brought by Al solutions. 


The shortage of cross-domain data is even more severe than 
that of public data. The lack of acommon data exchange and 
sharing utility in Hong Kong makes it challenging to obtain 
non-bank datasets, such as customer lifestyle information. 
Even after customer consent has been obtained, a lack of 
comprehensive regulations defining how much personal 
information can be shared makes banks hesitate about using 
such data to avoid potential compliance or regulatory risks. 


Flaws in internal data further aggravate the situation. While 
banks capture huge volumes of information every day, 
much of it is not in machine readable format that can be 
fed directly to the Al model. Converting information into Al- 
friendly formats requires significant cost and time in manual 
labour for data tagging and cleansing. 


Moreover, banks are required to meet regulatory 
requirements on the handling of production data. Banks 
that are relatively mature in terms of Al already have policies 
in place. Many of the banks we interviewed said that 
they have leveraged the risk modelling guideline and risk 
management policies for Al model governance and control. 
However, banks which are new to Al need to accelerate 
the development of their data policies in order to mitigate 
model and compliance risks. 


3.4.3. Legal and compliance challenges 


Banks worry about regulations that may create future costs. 
Other hidden costs generated by the inherent uncertainties 
of new technology also make Al a difficult choice. 


Data privacy and protection requirements 


Like other data users, banks need to comply with the 
Personal Data (Privacy) Ordinance (PDPO). Banks are 
reminded to do so in the Consumer Protection in respect 
of Use of Big Data Analytics and Artificial Intelligence (BDAI) 
by Authorised Institutions circular released by the HKMA in 
November 2019°°. Data privacy legislation overseas, such 
as the EU’s General Data Protection Regulation (GDPR) 
(effective from May 2018), the California Consumer Privacy 
Act (effective from January 2020) and others may also 
impact Al development. For example, Article 29 of the 
GDPR" requires transparency in profiling customers, where 
profiling refers to the evaluation of personal aspects using 
personal data, such as prediction of customer preference or 
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classification of customer segments. Potentially, customer 
segmentation and risk profiling by banks will need to be 
reviewed to ensure that proper safeguards are in place for 
the model to make objective recommendations. Activities 
including ‘data collection’, ‘automated analysis to identify 
correlations’ and ‘applying the correlation to an individual 
to identify characteristics of present or future behaviour’ are 
all subject to the regulation if their purpose is for automated 
decision making. 


Banks may therefore need to consider the following potential 
risks when collecting and making use of customer data: 


e Does the user agreement provide adequate consent to 
banks in relation to privacy rights for Al models? 


e Does customer consent cover biometric customer data 
such as voice, facial patterns, and fingerprints, and other 
personal data used for Al development? 


e Can banks be sure that their current data collection, 
storage, and analytic processes comply with privacy 
regulations and are sufficient to mitigate the risk of cyber- 
attack? 


Uncertainty in ethical Al regulations 


In addition to privacy, there are other concerns relating to 
ensuring that safeguards are in place to protect individuals’ 
freedoms and rights in terms of choices. 


In May 2019, France and Canada” announced a plan to 
co-create an International Panel on Artificial Intelligence 
(IPAI) to “support and guide the responsible development 
of artificial intelligence that is grounded in human rights, 
inclusion, diversity, innovation, and economic growth”. 


A number of bodies are drafting Al ethical guidelines 
which cover these areas. For instance, the Institute of 
Electrical and Electronics Engineers (IEEE) has published 
the first edition of an ‘Ethically Aligned Design’ paper*' that 
discusses ethical principles in implementing Al. The paper 
acknowledges the huge potential impact of Al on society, 
and the importance of addressing the non-technical 
aspects of Al, such as human rights, safety and privacy. 
It sets out eight general principles for achieving ethically 
aligned design, and contains discussions on topics such as 
the legal framework for Al accountability. 
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The European Commission has commissioned a high- 
level expert group on Al to draft ‘Ethics Guidelines for 
Trustworthy Al’*?, which will serve as a starting point for 
discussions around European regulation and legislation and 
will set out a framework for achieving trustworthy Al. The 
guidelines note that Al should be lawful, ethical and robust. 
However, they focus on the ethical and robust issues, and 
offer no suggestions regarding lawfulness. The publication 
states that the guidelines are not intended to replace 
current or future policies and regulations. This to a certain 
extent indicates the relevance of having legal regulations in 
Al development, but also indicates the complexity involved 
in developing clear regulatory guidelines. 


Uncertainty over technology assessment 


A comprehensive technology assessment framework is 
recommended for assessing different Al applications or any 
complementary IT solutions to ensure their security and 
reliability. However, it is often a challenge for banks to assess 


— a 
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the technology risk associated with adopting an Al solution, 
owing to the complexity of the solution and inadequate 
in-house Al expertise. Also, the emerging nature of these 
technologies means that the industry has not had enough 
time to build up a well-recognised standard or practice for 
such a framework, making for increased compliance risks 
when adopting Al solutions. Technology risks aside (e.g. 
cybersecurity), regulatory compliance is a major concern 
for banks when adopting Al. 


Nowadays, more and more popular Al libraries are open 
source and freely available for developers to leverage and 
modify when building their own Al solutions. However, 
should banks feel that managing the technology risks 
associated with open source solutions is a challenge, these 
resources may become an obstacle instead of a driver. It 
is very important for guidelines and industry-recognised 
certification processes to be established to assess and 
govern the use of new technologies. 
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'2 High-Level Expert Group on AI. (8 April 2019). Ethics guidelines for trustworthy Al. https://ec.europa.eu/digital-single-market/en/news/ethics- 


guidelines-trustworthy-ai 
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Examples of Al in 
banking today 


Key Takeaways 


In the banking sector, Al has been applied in both customer 
facing activities and back-office operations. The three most 
popular drivers for adoption of Al are to improve customer 
experience, automation and quality control, and risk 
management. 


Improving customer experience is one of the major drivers for 
the adoption of Al. Banks have used AI to analyse customer 
data so as to offer personalised wealth management 
services, as well as for reading customer information and 
confirming customer identify so as to offer remote customer 
onboarding. 


In automation and quality control, the most common Al 
use cases are cheque processing using optical character 
recognition; helping contact centres through speech-to-text 
and natural language processing; and assisting customers 
using Chatbots and robotic process automation. 





Another common application of Al application is for risk 
management. Al can help banks to better assess risk and 
detect fraud by combining both supervised and unsupervised 
learning models to build detailed individual customer risk 
profiles. In bank operations, Al can help to monitor the 
bank’s IT infrastructure, detecting and preventing potential 
attacks. Lastly, combining Al with alternative data can be 
used to assess credit risk for SMEs without the need for 
traditional credit data. 
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investment made 


In the previous section we explored major drivers of 
investment in Al technologies, which included the need 
to improve quality and customer experience, reduce 
operational costs and strengthen risk management. Some 
barriers to adopting Al technologies were also explored. 


The previously mentioned survey (see Appendix B) also 
examined how the banking sector is coping with the 
technology. In the survey, a total of 27 Al use cases were 
identified. These are either being adopted, being planned 
for, or are being explored. 


The top five Al use cases being adopted by the banking 
sector relate to cybersecurity, client-facing Chatbots, remote 


Significant 


Core Banking Customer 


Account 
Management 


67.00 


and mature Al 
capability in 
place 


48.00 


Customer 
Acquisition 


46.00 










Segmentation & 
Insights 





32.00 







Product 
Innovation & 
Pricing 


23.00 


Collateral 
Management 


16.00 





No Al 
applications 
and no Al 
capacity 
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Call Centre 
29.00 


client on-boarding, biometric customer identification, and 
personalised advertising. As for planned use cases, the 
most attractive for banks include anti-money laundering, 
fraud detection and financial advisory. Overall, the current 
trend for Al adoption is towards the development of risk 
management and customer-facing applications. 


To further illustrate the status of Al use cases in the sector, 
the 27 use cases have been mapped to the 25 banking 
functions shown in the table below. We have also quantified 
the average investment in each of these banking functions 
to create a comprehensive picture of current Al investment 
in the Hong Kong banking sector. 


Corporate 


Channel Functions 


Propositions 


Fraud Detection 
and Prevention 


62.00 

















Technology 
Management 


54.50 


Clearance & 
Settlement 


33.33 


Compliance 
Documentation & 
Reporting 


33.00 


Talent 
Management 


31.00 







Risk Assessment 
& Monitoring 


27.00 


Corporate | Consumer 
Lending | Financing 


26.00 





Investment & 
Securities 


20.33 
Wealth 
Management 
19.50 






Finance & 
Reporting 





13.00 


This section summarises and discusses the three most 
popular areas for Al adoption: customer experience, 
automation and quality control, and risk management. 
For each of these areas, several common use Cases are 
discussed. 


Customer experience is an area that many banks in Hong 
Kong are looking to improve through Al. The most commonly 
adopted use cases are robo-advisors and remote client on- 
boarding. The adoption of Al has helped banks improve 
different banking functions, including customer acquisition 
and wealth management. 


Al can also help banks in automation and quality control. 
Popular use cases are cheque processing, voice and speech 
analysis, Chatbots and intelligent process automation. With 
the help of Al in automation and quality control, banks can 
improve their core banking and corporate functions, as well 
as offering a better customer experience. 


In risk management, Al can help banks better assess risk 
and detect fraud. Use cases identified are machine learning- 
aided transactional risk assessment and fraud detection, 
predictive maintenance in data centres, and credit risk 
assessment for SMEs with alternative data. 


Given that these use cases entail more mature technologies, 
they should be applicable as both a starting point for banks 
in Hong Kong that are just beginning to adopt Al, and as a 
source of ideas for more experienced banks. 
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4.1. Customer experience 
4.1.1. Financial advice with robo-advisors 


Robo-advisor technologies have emerged that offer 
professional investment recommendations to the mass 
public at a reasonable cost. Traditionally, the general public 
have rarely had access to the sort of financial advice provided 
by private bankers, nor have they had the experience to 
conduct financial analysis on their own investments. 


Robo-advisors act as low-cost alternatives by eliminating 
human effort and making investment recommendations 
available to anyone, anytime and anywhere. They utilise 
quantitative and machine learning models which are trained 
by inputting a huge amount of historical market data, with 


the aim of building an optimised and diversified portfolio 
that will maximise returns according to the risk appetites 
of the customer*. They are highly accessible as they are 
online 24/7, regardless of one's physical location. 


Wealth management is one of the main services provided by 
banks in Hong Kong. However, adoption of AI technologies 
in this field is still at a relatively early stage. According to our 
survey, out of 234 Al applications that have been launched 
or are going to be launched by banks in Hong Kong, just 
twelve are for investment and wealth management. Within 
these applications, only two are related to providing financial 
advice, although there is growing interest in applying Al for 
the automation of financial advice. 


Robo-advisors automate the portfolio management process 
as follows”: 


Investors are asked to complete an online guestionnaire that includes 
guestions related to their financial status, investment targets, expected 
returns, investments and risk appetite. 


The robo-advisor platform processes the inputs using proprietary 


algorithms, and makes personalised recommendations for allocating or 
rebalancing assets. 


Investors can adjust their goals and risk appetite at any time. The platform 
will then rebalance the portfolio accordingly. 





3 Tao, L. (20 July 2018). What's your risk appetite? Your robo-adviser has the answer. https://www.scmp.com/tech/start-ups/article/2102405/whats- 


your-risk-appetite-your-robo-advisor-has-answer 








“4 FrankenField J (1 October 2019). What is a Robo advisor? https://www.investopedia.com/terms/r/roboadvisor-roboadviser.asp 
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A Hong Kong-based robo-advisor*® can use machine 
learning algorithms to create a portfolio that contains 
multiple exchange-traded funds (ETFs) catering to Hong 
Kong-based investors, based on their risk appetite. Trained 
by supervised learning using historical market data, a robo- 
advisor selects stocks and other assets based on criteria that 
include customer risk appetite, industry preferences and up 
to 200 other factors. Among these are transaction volume 
and price, fundamentals, macroeconomic factors derived 
from market data, financial statements and analyst reports. 
The portfolio performance is monitored and attribution 
analysis is performed to identify the key factors contributing 
to profit and loss. This is updated to the machine learning 
model for further stock selection**. 


Despite the potential benefits of robo-advisors, the guality 
of their performance is yet to be proved. Comparison of the 
portfolio performances of different robo-advisors shows 
that they do not always yield positive returns”. 


New regulatory frameworks are being developed to impose 
stricter regulations on robo-advisors. In July 2019, the Hong 
Kong Securities and Futures Commissions (SFC) issued 
Guidelines on Online Distribution and Advisory Platforms, 
which includes a chapter on robo-advice. It covers topics 
such as availability of information to customers, client profiling, 
system design and development, supervision and testing of 
algorithms, staff resources and automatic rebalancing. 


The Financial Services Regulatory Authority of Abu Dhabi 
Global Market (ADGM) has also issued a guideline on robo- 
advisory activities. This** explicitly includes an algorithm 
governance section which recommends that the robo- 
advisory model must produce explainable, traceable, and 
repeatable outcomes. However, some robo-advisors have 
adopted sophisticated deep learning models which are 
difficult to explain, while others have adopted explainable 
models for which the investment details have not been 
disclosed for proprietary reasons. 


Scan 


e Use OCR to extract 
info from HKID 

e Use facial 
recognition to 
confirm identity 


Prepare 


e Email address 
e Mobile phone no. 
e HKID 


+5 AOUMON. https://www.agumon.com/ 
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4.1.2. Remote client on-boarding 


The process of opening a bank account often defines 
a customer's future relationship with a bank. As simple 
as it may seem, it often involves paper-intensive manual 
processes. A frictionless process, conducted either through 
a human advisor or digitally, can leave the customer with a 
highly positive first impression of the bank's service quality. 


Remote client on-boarding uses machine learning and Al 
for fraud detection, displacing traditional knowledge-based 
authentication (KBA) methods for customer due diligence 
(CDD). It has the potential to save significant processing 
time and costs by digitalising end-to-end processes. For 
example, one Hong Kong bank claims that the time needed 
to open a bank account remotely using a mobile app can 
be shortened to just fifteen minutes®°. Customers can 
thus spend more time on the actual banking services they 
require, while banks can channel human resources towards 
performing complex banking transactions and higher value- 
added activities. 


According to our survey, customer acquisition is an area 
that Hong Kong banks want to improve with the help of Al. 
More than 27 Al applications have been launched or are 
planned in this area, while another 22 are being explored 
or studied. Over 63% of these applications are related to 
remote client on-boarding. 


Unlike face-to-face account opening, remote client on- 
boarding involves a stringent Customer Due Diligence 
identification process. This new account opening process 
is shown below, during which various computer vision 
technologies (such as machine learning enhanced Optical 
Character Recognition (OCR), facial recognition and voice 
authentication) are applied in the scanning and checking 
process: 





Check (optional) 
Send to the 
back-office for 
further AML/KYC if 
needed 





Finished 
e All documents 
verified and account 
opened successfully 


Complete 
e Personal Information 


AQUMON A RER A 5| E28 ! https://www.aqumon.com/webview?content=news&id=161&lang=zh_cn 








Fund Selector Asia. (9 September 2019). Robo-advisor performance: August 2019. https://fundselectorasia.com/robo-advisor-performance- 


august-2019/.Activities.pdf 





+8 Securities and Futures Commission. (July 2019). Guidelines on Online Distribution and Advisory Platforms. https://www.sfc.hk/web/EN/assets/ 
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Extracting ID information with machine learning enhanced 
Optical Character Recognition (OCR) 


When a user uploads pictures or takes a picture of his or 
her ID document with a mobile device, other information 
on the ID document is also captured for further recognition. 
Machine learning algorithms then help identify the 
expected position of words based on the type of document 
recognised. Optical character recognition (OCR) is applied 
to extract words from the ID document and validate these 
against the user’s input. Machine learning helps enhance 
OCR technology by handling the unstructured, incidental 
images in the user’s uploaded documents with greater 
accuracy. It typically makes use of neural network models 
that can deal with complex backgrounds, noise, lighting, 
fonts and other distractions. 


Verifying customers’ identity using facial recognition 
and biometric liveness detection 


Facial recognition technology can match the customer’s 
face with that of the photo extracted from their ID document. 
Multiple digital images are sometimes required at different 
angles to verify unique security features on the ID card, 
such as microprint, hologram, layout and font®'. The user 
may also be asked to record a 2-3 second selfie video, and 
a face extraction model is used to verify whether the video 
was shot by the user in a live environment. 
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Verifying customers’ identity using voice authentication 
technology 


The customer’s identity can also be verified by what is 
called speaker recognition, or voice authentication. This 
makes use of the unique acoustic features of the voice to 
identify a user. Al-assisted voice recognition authenticates 
individuals more precisely by training on a large amount of 
voice samples. Under the Al approach, voice authentication 
performs better in noisier situations, thereby guaranteeing 
more functional usability and robust security. 


All this information and documentation can then be sent to 
the bank’s back office for further AML or KYC checking, 
as appropriate. This may require checking and vetting with 
external regulatory agencies and law enforcement agencies. 
Al can help to facilitate AML and KYC checking processes, 
with NLP extracting information from scanned documents 
and intelligent process automation (IPA) cross-checking 
information from scanned documents with different data 
sources in real-time. 


The core risk of remote client on-boarding is in validating 
the authenticity of biometric information in order to rule out 
synthesised or faked biometric identity. The availability of 
open-source human image and video synthesis programmes 
online has lowered the entry barriers for fraud, and is giving 
rise to higher risks of identity impersonation. According 
to the Voice Intelligence Report* published by call centre 
software solution provider Pindrop, voice fraud surged 
more than four times from 2013 to 2017 and is continuing 
to increase. 


Legislation such as the ‘Defending Each and Every Person 
from False Appearances by Keeping Exploitation Subject 
to Accountability Act of 2019’, also known as the Deepfake 
Accountability Act, has been proposed to tackle the 
problem of Deepfake videos in the US. Facebook is joining 
forces with Microsoft researchers and academics to run the 
Deepfake Detection Challenge (DFDC), with the intention of 
creating software to detect Deepfake videos*?. 


Despite significant efforts by various parties to defend 
synthetic biometric identity, this could still be a key 
challenge for banks, as counterfeiting techniques are 
constantly evolving in the digital world. To mitigate risk, the 
banking sector need to closely monitor the development of 
Al technologies and establish a comprehensive technology 
risk assessment framework. 


4.2. Automation and quality control 
4.2.1. Cheque processing 


Bank cheques have been and will continue to be widely 
used for financial transactions around the world. Enormous 
volumes of written and computer-printed bank cheques 
are processed and verified manually every day. They often 
need detailed information checking to prevent incorrect 
processing. Considerable operational resources are 
therefore devoted to visually verifying every field in each 
cheque, such as date, signature, and courtesy amounts. 
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To reduce costs, a digital cheque clearing system (i.e. a 
cheque truncation system that involves the digitalisation of 
physical cheques into electronic images for transmission 
between banks) has emerged as a global trend**. With 
the help of this system, cheque images and magnetic ink 
character recognition (MICR)** data (including bank codes, 
bank account and chegue numbers) are captured at the 
collecting branch and transmitted electronically. This can 
be seen as a first generation application of optical character 
recognition (OCR) for chegues, where the system can only 
capture limited information and recognise pre-set computer- 
readable fonts®. 


Unfortunately, significant manual efforts are still required on 
top of this OCR-assisted process. To further reduce cost, 
time and effort, banks have started exploring a solution 
capable of automating the entire process of cheque 
recognition, verification and data entry, even for handwritten 
cheques. This is a cheque recognition solution based on 
OCR and supplemented by deep learning models. 


A comprehensive cheque processing solution typically 
starts with input capturing and pre-processing from mobile. 
The bank then extracts the MICR signature as well as other 
items, and passes them on for identification and recognition 
of the information needed, such as bank information and 
legal amount. All the recognised contents are then sent to 
the database for rule-based verification. The last step is to 
export the verified results for cheque clearing. This high- 
level process map*” is shown below: 
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Al is used to analyse the cheque layout during image 
capturing and pre-processing to identify relevant 
information to be extracted. OCR technology is then used 
to extract text from identified fields in the electronic images. 
Pre-set fonts using MICR are recognised simultaneously, 
identifying the payee account and bank. Next, a deep 
learning neural network model is employed for handwritten 
character recognition, a model which is able to continuously 
learn different styles of handwriting during processing to 
constantly improve its recognition accuracy. 


Unlike in the traditional cheque recognition system, by 
incorporating deep learning into the solution banks are 
able to minimise human involvement because the entire 
cheque, including both computer-printed and handwritten 
fields, can be recognised and transacted automatically. 
With appropriate infrastructure and computing power, this 
automated cheque processing solution is perfect for large 
scale processing, as it takes less than one second to read 
an entire cheque®®. Furthermore, the self-learning nature 
of deep learning models means that recognition accuracy 
improves over time as more data is captured. According 
to one of the interviewed banks, a drastic improvement 
in recognition accuracy from around 65% to 95% was 
recorded after it applied a deep learning model to enhance 
OCR cheque recognition. 


The multilingual environment in Hong Kong further increases 
the variety of characters to be processed, as Chinese OCR 
is significantly more difficult than English. The large number 
of Chinese characters and high textual information density 
make the successful recognition of characters challenging. 
The deep learning model thus requires a huge amount of 
training images, and significant human effort to prepare 
data for supervised learning. This has incentivised banks 
to develop deep learning models or seek local experts for 
training and data preparation to ensure the recognition 
procedure is accurate and applicable to the local market. 


4.2.2. Voice and speech analysis in contact centres 


Call centres have acted as a major service and sales 
channel for banks since their inception in the 1980s?9, 
and still remain a significant channel by which customers 
interact with bank representatives. In the digital era, many 
call centres have been transformed into ‘contact centres’, 
with greater capabilities for handling customer enquiries 
through e-mails, instant messaging applications and 
social media. These interactions with customers generate 
massive amounts of new data every day, both as a result 


of transactions (e.g. waiving of credit card annual fees) 
and activities (e.g. the conversation between customer 
and agent when negotiating a fee waiver). The data points 
containing valuable customer insights (i.e. data stored in a 
structured format, such as databases, or in an unstructured 
format, such as a recorded customer call) are important for 
the bank to identify potential improvements in its operations, 
and to identify new business opportunities. 


Voice and speech analysis is used in call centres to help 
improve their customer service, segmentation and insights 
functions. Over the years, banks have mined the structured 
data they have captured, but have found it costly to utilise 
unstructured data. For example, manual efforts are required 
to listen to and summarise samples of customer call records 
for compliance, investigation and training purposes. It is 
unfeasible to tag and analyse every call, given the significant 
cost of hiring people to listen and summarise. 


In this field, two Al applications have been launched in 
Hong Kong and four are at the planning stage. But there is 
growing interest among Hong Kong’s banking sector, with a 
further eleven applications being studied. 


Common scenarios that involve applying Al-enabled 
conversation analysis at a contact centre include: 


e Voice recognition technology to identify and verify a 
customer’s identity based on his or her voice signature. 


e Speech-to-text technologies that convert customer 
dialogue into text in real time. 


e Natural Language Processing to analyse and summarise 
customer dialogue without the need for human 
involvement. 


e Sentiment analysis, emotion detection and conversation 
pattern analytics, which deduce customers’ intentions 
and preferences. 


e Real-time recommendations for contact centre agents on 
advice to give to the customer. 


* Sales compliance analysis. This can also be conducted 
in a much more effective manner with the support of Al 
technologies. Reports on problematic calls can also be 
generated by tracking various customised metrics. 


To enhance the productivity of contact centre agents, AI can 
be deployed to analyse their performance, supplementing 
traditional contact centre key performance indicators (KPIs) 
such as response time and number of complaints handled. 
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Specific agent performance metrics, including silence 
ratios, speaking speed and interruption rate, can also be 
analysed to generate personalised and real-time feedback 
for each agent. For instance, one bank has applied an 
Al-based solution to analyse the correlation between an 
agent’s seniority, work time and productivity©. 


The data generated from contact centre activities and 
other operational functions of a bank can generate insights 
into ways in which the bank could enhance the cost- 
effectiveness and efficiency of the contact centre. For 
example, one bank recorded a 15% increase in sales quality 
scores and a 10% increase in net promoter scores as a 
result of its ability to accurately analyse every call. This is 
also useful for identifying sales opportunities. 


One key challenge to developing voice and speech analysis 
in Hong Kong is the difficulty of processing Cantonese, and 
the use of mixed languages (e.g. English and Cantonese) 
by many locals. Some technology breakthroughs have 
been made by local FinTech start-ups that are improving 
speech-to-text and NLP accuracy. According to one of 
the technology start-ups we interviewed, their technology 
recognition accuracy now stands at around 80% compared 
to human accuracy. 


4.2.3. Chatbots 


Chatbots are online, virtual conversation agents that allow 
automated responses based on customers’ input, and 
simulate human interaction without the presence of a 
contact centre agent. They have become an increasingly 
popular service tool in places where banks find it hard 
to justify investment in property and labour for physical 
contact centres. Ideally, Chatbots provide faster and more 
consistent responses than a human agent. Human effort is 
also eliminated, since Chatbots handle a lot of repetitive 
questions. 


Natural language processing 
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Chatbots are one of the top three most popular applications 
among the 27 listed in our survey. The banking sector has 
shown widespread interest in this technology, especially 
due to its ability to help automate account management 
in core banking, and customer services in the mobile and 
Internet channels. Seventeen banks have already launched 
or plan to launch their own Chatbot, while 25 are exploring 
this option. 


One of our interviewee banks stated that its Chatbots handle 
more than four million Customer enquiries from Mainland 
China on a daily basis. Most Hong Kong respondents are 
exploring more extensive use of Chatbots, in light of their 
benefits in terms of cost savings and their ability to capture 
customer insights that are valuable for marketing and 
product design. 


By using NLP, a Chatbot can understand what people are 
saying in different languages. By observing human actions, 
the robot also accumulates common in-house processes 
and systems knowledge. With machine learning and rule- 
based logic from an in-house IT knowledge base, the 
Chatbot can learn to diagnose a problem, decide whether it 
is something it can fix and, if not, pass it on to a human IT 
support team. 


A Chatbot can handle speech input if a speech-to-text 
synthesiser is used. It can then process the input to identify 
keywords and key terms, and predict the user’s financial 
intent using natural language processing (NLP). It can 
automatically generate responses by leveraging responses 
that are rule-based or based on the user’s predicted request, 
finding corresponding structured data to form a machine- 
readable response. Finally, it applies natural language 
generation (NLG) to respond. 


Natural language generation 


Analyse P 

D y Identify intent Compose reply 
request 

89 Sestek. ING Bank Call centre Speech Analytics Case Study. https://www.sestek.com/case-studies/ing-bank-speech-analytics-case-study/ 
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If needed, the Chatbot can then integrate with the bank’s 
internal workflow engine to trigger different types of 
transaction requests. More advanced implementation of Al 
can reinforce machine learning by continuously feeding new 
data to the model, improving its performance. 


Banks have also been piloting the deployment of cognitive 
robots for internal customer service helpdesks. Another 
virtual robot can handle simple problems that amount 
to nearly half of all support requests (e.g. email issues, 
password resets and Wi-Fi or network issues). This Chatbot 
is still in the early stages of development in that the services 
it can provide are limited and significant support from 
humans is still very much needed. But by continuously 
learning from, developing, and interacting with customers, 
it becomes able to solve more complicated problems and 
provide better responses. Following training, the robot can 
already understand 87% of queries. Staff from IT support are 
certain that virtual robots and humans can work together, 
enabling humans to focus on providing more complex 
support services*'. 


In an ideal situation, a Chatbot should be able to accurately 
understand a customer’s intentions and provide prompt 
and consistent responses. Incorrect predictions of intent 
can result in confusion and customer dissatisfaction. In 
reality, however, the Chatbots deployed by many of the 
banks we surveyed are still at the development stage, only 
offering basic banking services such as information about 
balances and account details. Chatbots are still augmenting 
human actions rather than replacing them. Their underlying 
limitations, such as a lack of decision-making and research 
skills, require Chatbots to route complex questions to 
human staff. 


4.2.4. Intelligent robotic process automation 


Banks have been automating repetitive operational 
processes by integrating systems since computers 
became commercially available. Alongside the traditional 
system integration approach with heavy reliance on IT 
development, a recent trend has been for system integration 
to utilise general purpose robotic process automation (RPA) 
software. RPA is extremely efficient at handling repetitive, 
structured data automatically. However, with the amount 
of unstructured data growing rapidly, banks are looking to 
ramp up their RPA systems with more advanced versions 
of process automation software, such as intelligent process 
automation (IPA). IPA combines machine learning with RPA 
to deliver powerful tools that mimic human interaction and 
make decisions based on robotic outputs. It produces 


automation capabilities that heighten business value 
and competitive advantages. With less need for human 
intervention, IPA can free staff from mundane activities so 
they can focus on processing higher risk or higher value 
cases. 


Most banks are attempting to leverage technologies to 
automate their operations, particularly for clearance, 
settlement and compliance documentation and reporting. 
According to our survey, ten Al applications are already 
in place among banks in Hong Kong to perform these 
corporate functions, showing that the adoption of Al 
technologies in these areas is relatively mature compared 
to other functional areas. According to BIS Research, the 
finance and banking sector is expected to occupy around 
40% of the global cognitive robotic process automation 
market by 2026°. Some banks have already adopted 
IPA to support or automate document processing, trade 
surveillance, compliance activities and customer services. 


Here are some common applications of IPA: 
Trade finance documentation 


Trade finance transactions usually involve large amounts 
of documentation from different parties, such as the 
financial records of the borrower, sales orders or invoices, 
trade documentation such as bills of lading and customs 
declarations, along with other compliance and safety-related 
documentation. Trade finance relies on the authenticity of 
the documentation, and hence extensive vetting and cross- 
checking by the bank is required. However, owing to a lack 
of international standards for global trade documentation, 
rule-based automation cannot be applied. 


IPA utilising deep learning-based computer vision and 
other machine learning techniques can be used to extract 
information from scanned trade documents. Other machine 
learning algorithms can be used to classify the document 
type by clustering documents that share similar document 
elements or features (e.g. procurement orders or invoices). 


Based on the document type identified, cognitive agents 
can make use of Al-enhanced OCR (optical character 
recognition), HCR (handwriting character recognition) and 
NLP to extract trade information from the text. Document 
information captured can then be processed, validated, and 
reconciled (e.g. by the matching of procurement orders and 
invoices), and corresponding workflows can be triggered 
automatically using pre-defined rules. 
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Some banks®™ have already developed Trade Al Engines 
using NLP and machine learning-enhanced OCR to classify 
and process millions of pages of unstructured trade 
documents, digitising them into machine-readable format 
in order to generate insights into the networks of related 
parties and customer trade activity. 


Cybersecurity 


Banks need to safeguard their systems 24/7 to prevent 
system outages or data losses caused by cyber-attacks, 
so they need to perform regular cybersecurity incident 
simulations and drills to discover cyber threats and 
investigate security risks across the whole IT infrastructure. 
Such comprehensive simulations and investigations are 
labour intensive. 


IPA can be deployed to mimic users’ behaviour, including 
how users interact with systems daily and how they react 
to possible cyber-attacks, in order to discover security 
loopholes in an application, IT infrastructure or operational 
process. IPA can supplement the work of humans on 
large-scale incident simulations which traditionally involve 
hundreds of operational staff. The same simulation practice 
assisted by IPA could be applied in advance before rolling 
out new releases of applications to help banks identify and 
prevent a zero-day attack (i.e. an attack on a vulnerability on 
the day of its discovery). 


Regulatory change management 


Given the complexity of many new or newly updated 
regulations and the importance of implementing them 
correctly, compliance teams in banks have traditionally 
used a manual change management process. This includes 
reading and analysing the regulations, identifying the 
implications, and updating corresponding internal policies 
and procedures to implement the regulatory changes. 


Some banks are exploring the use of RPA to monitor 
regulatory changes published on regulators’ websites, and 
of IPA to analyse the regulatory documents and identify 
changes. IPA can identify which banking lines and functions 
will be impacted by regulatory changes, and trigger a 
corresponding workflow to handle the impact assessment 
and change management processes. The HKMA is studying 
the use of machine-readable regulations under its Smart 
Banking initiatives to support banks’ regulatory compliance 
processes®, 
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4.3. Risk management 


4.3.1. Machine learning-aided transactional risk 
assessment and fraud detection 


Traditionally, banks use rule-based systems with predefined 
transaction patterns to identify potential money laundering 
activities. In cases of complex transactional behaviour and 
patterns, the volume of false positive alerts identified by this 
approach increases, requiring significant time to investigate 
alerts and filter false alarms. One recent advance is to apply 
machine learning to the detection of money laundering’. 


Machine learning models, trained by customer and 
merchant data from various internal and external data 
sources, are used to understand business activities and 
complex transactions. The trained models can be deployed 
to scan through all the suspicious transactions identified by 
the AML solution® to differentiate false positive alerts from 
transactions that really require investigation. In order to 





continuously improve the model, new data patterns and the 
results of investigations by humans on identified suspicious 
transactions are looped back to the machine learning model 
for smarter future recommendations. 


By combining both supervised and unsupervised learning 
models, detailed individual customer risk profiles can 
be built. Every transaction can then be scored based on 
the corresponding level of fraud or money laundering 
risk. Such risk prediction input has been widely adopted 
to enhance existing AML decision engine rules in use 
cases such as e-commerce transactions, mobile banking, 
loan applications and recurring banking payments. For 
instance, real-time AML assessment of customer fraud 
risk based on a customer’s risk profile can be conducted 
during the customer on-boarding process. When higher 
risk is predicted or not enough information is available for 
decision-making, additional customer-specific follow-up 
questions can be triggered to protect the bank from the risk 
of fraud. 


8$ Twomey,N. (19 December 2018). Corporate Compliance Insights. 5 Ways Al is Impacting AML and KYC Compliance. https://www. 
corporatecomplianceinsights.com/5-ways-ai-is-impacting-aml-and-kyc-compliance/ 
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Banks have a long history of applying data analysis for 
fraud detection and prevention, making this use case 
popular among banks. In our survey, it ranked as the top 
area for which banks would like to apply Al. Currently, 36 
applications have already been launched, 23 are being 
planned, and 94 are at the study stage. The survey also 
shows that most of these applications focus on Anti-Money 
Laundering (AML) related issues. 


One area where Al technology is already being applied is 
in the real-time monitoring of fraudulent activities relating 
to credit card payments. Machine learning models for 
fraud detection are trained using large historical credit card 
transaction data sets. Using actual fraud cases that have 
been reported and verified, banks can tag historical data 
with relevant fraud and non-fraud labels to train classification 
algorithms. The trained model can then identify high risk 
suspicious transactions with pre-determined features 
indicating fraud. These range from transaction times 
and locations and other spending behaviour statistics to 
the historical transaction patterns of card holders. Geo- 
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location data is one pre-determined feature, as fraudulent 
transactions commonly occur far away from where card 
holders are located at the time of purchase. 


Banks also apply clustering and other unsupervised 
learning methods to identify new relevant transaction 
patterns or features without human labelling®’. Clustering 
algorithms group common transactions into clusters with 
similar behaviours, and these are set as the baseline for 
normal transactions. Anomalies are then identified as 
outliers, lacking strong membership of any cluster group, for 
additional review. Human operations staff can then confirm 
if these transactions are legitimate, and at the same time 
the results can be fed back to the learning model to improve 
accuracy through reinforcement learning. One bank we 
interviewed claimed that its false positive rate for flagging 
laundering risk was reduced by 20% after a machine 
learning-based system was deployed to incorporate newly- 
found patterns into the original AML solutions. 


87 Westerlund, H.S. (25 October 2018). The big Challenges Faced in KYC Implementation Today. https://blog.instantor.com/the-big-challenges-faced- 


in-kyc-implementation-today 








54 





4.3.2. Predictive maintenance in data centres and 
operations risk 


In the digital era, Keeping data centres up and running 
is among the most critical of a bank's missions. A host 
of applications, from online banking to trade settlement 
and from accounting to risk management, all rely on data 
centres providing uninterrupted around-the-clock service. 
However, banks are finding it increasingly challenging not 
only to maintain, but also to expand their data centres, given 
the massive scale and complexity of banking systems. It is 
also challenging for their IT departments to keep up with 
the rapidly changing business environment. As customers 
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According to our survey, technology management is one of 
the top banking functions that banks are choosing to invest 
in. Eighteen applications are now live and 16 are planned. 
Within technology management, cybersecurity has the most 
applications, with 13 launched, seven planned and 14 under 
study. An intelligent identification system can help enhance 
and improve technology management accordingly. 
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server abnormalities 


infrastructure issues 


expect ever faster turnaround of products and services, 
bank IT departments have been busy scaling up their 
infrastructure, enhancing cybersecurity and ensuring that 
systems remain stable and robust. 


AlOps (Al for IT Operations) greatly improves the availability 
and the consistency of performance of bank production 
systems. Hidden server abnormalities can be discovered 
by Al in advance through operation and maintenance log 
analysis, and the relevant IT team can then intervene and 
proactively respond to potential production or security 
issues. With Al-driven proactive alerts, the data centre 
operation model for faults can be transformed from passive 
firefighting to active fault prediction. 


Improve system 
performance consistency 


Better determine 
disaster recovery plan 


Resolve server issues and 
optimise server utilisation 


Enhanced threat 
detection and prediction 


AlOps has emerged among large corporations and financial 
institutions. It does not rely on human-specified rules, 
but uses machine learning algorithms to automatically 
process huge amounts of data generated by servers and 
applications. These include server logs, application error 
event logs, and sensor data within data centre premises. In 
a typical bank IT environment, with hundreds of application 
servers, there could be tens of thousands of server event 
trends or patterns, which are difficult for humans to identify. 
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Some banks have noted that adopting AlOps has helped 
to reduce the time needed to locate IT infrastructure issues 
and analyse problems and root causes, greatly improving 
IT service levels for emergencies and critical events®. In 
addition to proactively predicting and resolving server 
issues, AlOps can help to identify performance bottlenecks 
and optimise server utilisation by routing workload 
intelligently so as to reduce downtime in operations, data 
delivery delaysê* and power consumption”®. 











Model-based simulations are also performed by Al to 
optimise the data centre's server and network eguipment 
arrangements. Using simulations, the extent to which 
equipment changes may disrupt operations can be predicted, 
thus assisting the IT operation team to plan ahead and 
determine an optimum maintenance schedule. Some data 
centres even use Al-enabled predictive analytics to perform 
disaster recovery (DR) drill simulations to supplement their 
actual regular DR drills, which are costly, time-consuming 
and require significant levels of participation from IT and 
operations staff”. 


AlOps could also be used to detect and predict threats 
and other potentially malicious activities. It is very hard for 
conventional systems to keep up with all the new malware 
being created. However, banks can train Al with patterns of 
existing malware in order to detect future potential malware. 
Al can even be trained to detect the smallest signs of virus 
or malware attacks before they enter banks’ systems, so 
that they can be isolated in advance. 


In short, AlOps uses predictive analytics powered by Al 
machine learning to forecast and proactively prevent server 
issues, simulate different data centre scenarios, recommend 
changes and automatically optimise server workload and 
utilisation. All of these outcomes are helping increase the 
performance of IT infrastructure operations while reducing 
human effort. 
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4.3.3. Credit risk assessment for SMEs with alternative 
data 


Established banks have adopted the credit history 
assessment principle when estimating a potential 
customer’s default risk. Statistically, the longer the positive 
history of lending and repayment, the less risk arises for 
a particular credit transaction. These statistical models for 
credit risk assessment and the associated credit control 
framework have helped banks manage credit risk effectively 
in the past. 


To address the challenge of banks not having enough in- 
house data relating to credit history to produce statistically 
robust predictions for particular market segments, such as 
SMEs, the Commercial Credit Reference Agency (CCRA) 
was set up in Hong Kong in 2004. Its role is to collect SME 
credit information (including credit limits, indebtedness and 
credit history) and share this information (with the SMEs’ 
consent) with members of the Hong Kong Association 
of Banks (HKAB) and the Hong Kong Association of 
Restricted Licence Banks and Deposit-taking Companies 


(The DTC Association). As of 2017, the CCRA had collected 
over 176,000 credit records from 119,000 SMEs, sole 
proprietorships or partnerships with turnover of up to 
HKD50 million. The database was further expanded from 
December 2017 to cover non-listed limited companies with 
an annual turnover of up to HKD100 million’. 


While the CCRA provides a trusted source for building 
credit score models, a large proportion of SMEs still remain 
uncovered, given that there are 340,000 SMEs operating in 
Hong Kong”. These are uncovered either because of their 
brief operating history or because the owners have supplied 
incomplete or overly complex financial information. This 
shortfall of data has therefore excluded them from accessing 
loans under the current credit risk assessment framework. 


However, there are statistical ways to identify the correlation 
between a customer’s non-financial transactions and 
behaviour and their credit default risk. This can include 
factors such as demographics, expenses and payment 
transactions and lifestyle data. This is termed ‘alternative 
data’. 











Traditional credit data 










Alternative credit data 


Traditional credit data 
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Financial institutions in mainland China have successfully 
adopted machine learning to build a credit assessment 
model using alternative data. The People’s Bank of China has 
established a Credit Reference Center” with a centralised 
database of credit data for financial institutions to perform 
risk assessments. Financial institutions are also utilising 
other transaction data and demographic information to build 
a machine learning model with the potential of partnering 
with third-party payment platforms. 


Following the rapid expansion of e-wallets and other P2P 
platforms, a significant number of retail and business 
payments are now conducted on third-party online payment 
platforms such as Alipay and WeChat. FinTech is rapidly 
growing as an add-on service to these platforms where 
the transaction records can be used as alternative data 
for credit assessment. According to iResearch Consulting 
Group”, FinTech now accounts for nearly 45% of third- 
party online payment transactions, reaching CNY 29.1 
trillion in 2018. One of the interviewed banks stated that 
it had built a machine learning model with multiple layers 
made up of more than 1,000 variables derived from around 
200 attributes directly obtained from customers. The bank 
recorded a 20% improvement in the performance of its 
SME lending and its default rate dropped by 50% after the 
Al-enhanced model was applied. 


According to our survey, most banks have yet to apply Al to 
credit scoring as a way of supplementing traditional credit 
risk assessment. Only two applications have been launched, 
with three planned. However, banks are showing active 
interest in this area. Fourteen banks indicated that some 
applications are being studied, with reference to a number 
of successful implementation cases of Al-enhanced credit 
scoring outside Hong Kong. 


One obvious way to increase the performance of credit risk 
models based on alternative data is to increase the volume 
and types of variables feeding into the model. Federated 
learning, a new trend in Al, was adopted by one bank that 
we interviewed to securely learn from the encrypted data 
captured on electronic invoices, also known as ‘e-Fapiao’. 
These electronic invoices are shared by organisations 
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centrally in China to improve KYC assessment and 
credit risk measurement of SMEs. Raw data from invoice 
centres is encrypted and aggregated before being shared 
with banks, and banks can then use that data as training 
variables to improve their models, while the data itself 
remains encrypted. This ensures each partner remains the 
owner and retains control of their data without disclosing 
unnecessary data, while banks can still train the model!?. 


Besides introducing alternative data into the core credit risk 
assessment process, Al is also being adopted for wider 
credit risk management practices. For instance, ING has 
worked with Google and PwC” to create an Early Warning 
System (EWS) for credit risk using AI technologies. This is a 
good example of how AI can complement the already mature 
credit risk analytical framework. The EWS is designed to 
collect large amounts of structured and non-structured data 
(e.g. real time news feeds) into the big data infrastructure, 
then use Google's tools, such as NLP and machine learning 
models, to process and prioritise data that may impact on 
a customer's credit risk, which credit analysts can then 
further analyse. This enhances the performance of credit 
risk management analysts by helping them to handle large 
amounts of data in different structures and languages as 
well generating early warnings about particular customers 
or loans, prompting decisions and actions for managing the 
risk exposure. 


The discussion above indicates that, although banks’ 
credit risk management is mature and makes extensive 
use of data analytics, Al still has the potential to add 
value. While federated learning has proven promising for 
improving the performance of AI based credit assessment, 
it faces computing challenges because the underlying 
data encryption increases training times by a factor of 
hundreds or even thousands. Some technology start-ups 
are working on projects to overcome part of this problem 
with specialised hardware and technology architecture”. 
Federating learning is expected to be widely adopted 
once the technology becomes more mature, and as data 
collaboration alliances or partnerships are formed among 
banks and owners of alternative data sources. 
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(Disclaimer: The information provided in this section is solely contributed by PwC and is for general reference only. None of the contents in this part 
represent suggestions by or requirements of the HKMA. This chapter does not constitute legal advice and should not be treated as such. Banks should 
observe relevant legal and regulatory requirements at all times. In case of doubt, professional advice should be sought where appropriate.) 


Al implementation 


Key Takeaways 


To help organisations better implement Al solutions, PwC 
has developed a suggested high-level Al implementation 
journey of six major stages: Strategy, Planning, Ecosystem, 
Development, Deployment, and Operate & Monitor. 


The Al implementation journey starts with developing an Al 
strategy. This involves building a corporate-level Al strategy, 
aligning the strategy with regulations and industry standards, 
and refining internal policies and practices accordingly. At 
the planning stage, factors such as the impact of Al on a 
bank’s revenue and reputation need to be considered. At the 
ecosystem stage, banks will need to assess their available 
Al talent and purchase external solutions where necessary. 
When developing and deploying Al solutions, banks need 
to follow an agile but well-governed development and 
deployment lifecycle. Lastly, the operation of the Al solution 





will deviate from traditional IT applications because of the 
need to monitor and calibrate it, and the fact that the model 
is governed by a series of explainable, technology risk 
management principles. 


The foundation of Al implementation is governance. To 
establish proper oversight and governance, banks should 
ensure that controls are in place to track performance, risks 
and impacts. 


To help organisations with their AI implementation and to 
mitigate risks, PwC has introduced different tools designed 
to support the measurement and assessment of Al 
implementation from the perspectives of interpretability and 
explainability, robustness and security, and fairness. 
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PwC’s research estimates that Al could contribute $15.7 
trillion to the global economy by 2030, as a result of 
productivity gains and increased consumer demand driven 
by Al-enhanced products and services. Al solutions are 
spreading across industries and impacting everything from 
customer service and sales to back office automation. 
According to our survey, 95% of banks in Hong Kong are 
also considering making Al part of their corporate strategy. 


However, as discussed in section 3.4, the lack of 
explainability in the Al model, the lack of resources for its 
adoption, and legal and compliance challenges are holding 
some banks back from embracing Al. 


Banks need to overcome these barriers and manage risks 
when implementing Al-related technologies in order to 
remain competitive and cost efficient by comparison with 
the new challenger banks that are entering the market 
globally. 


To help organisations identify and address challenges when 
designing and deploying Al applications, PwC has published 
A practical guide to Responsible Artificial Intelligence 
(Al)’®, which outlines a suggested high level end-to-end Al 
implementation journey. 





Operate 
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PwC's Al journey 
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Responsible Al implementation involves a clearly defined 
journey of six major steps: Strategy, Planning, Ecosystem, 
Development, Deployment, and Operate & Monitor. Such a 
framework to guide banks through the journey is important, 
and should address questions such as who is accountable, 
how and when will Al affect the business operation, what 
decisions need to be made and when, what controls need 
to be in place and when, and whether its overall change 
management is aligned to the bank’s business goals. 


Governance is amajor concern for banks when implementing 
Al solutions. In this section, PwC will also propose eight 
major considerations for governance, and offer an illustrative 
governance standard as a reference for banks. 


To address Al implementation challenges it is important to 
have tools and accelerators in place before Al development 
and deployment begin. Tools are needed to assess how 
well challenges are being monitored, to what extent gaps 
in the plan are being identified and resolved, and whether 
the Al solution is providing the benefits expected. The final 
section introduces a Toolkit and explains how banks can 
benefit from it to address these issues. 


Model refinement 
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5.1. Key considerations in defining a 
high-level Al implementation journey 


This section describes a high-level end-to-end Al 
implementation journey from strategy to operation. Banks 
are invited to leverage this to plan how to develop and 
implement their Al strategy, how to build an ecosystem 
or collaborate, and how to develop, deploy, operate and 
monitor their Al solutions. 


5.1.1. Strategy 


The Al implementation journey starts with developing 
an Al strategy. This involves building a corporate-level Al 
strategy, aligning the strategy with regulations and industry 
standards, and refining internal policies and practices. 


Corporate-Level Al Strategy 


- Organisation 
- Process 


- System 
: Talent 


Regulations & Industry Standards 


GDPR 
PDPO 
Fairness 


Transparency 
Accountability 
Ethics 


Internal Policies 


& Practices 


Data 
Talent 
1. Building a corporate-level Al strategy 


Algorithm 
Infrastructure 


In building a corporate-level AI strategy, banks should first 
inform stakeholders at different levels about how AI will 
be used to drive enterprise values and goals within the 
organisation. This can be done by conducting initial market 
research and strategic assessments on common AI use 
cases and their applicability to the banks, and performing 
an organisational assessment to identify major gaps in 
processes, people, systems and data. Banks can then 
guantify the enterprise value of AI by conducting financial 
analysis to determine the ROI for AI, and impact analysis 
on factors like risks, controls and talent. A strategic agenda 
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can then be outlined for both short-term and long-term 
activities, and priorities can be defined as to what needs to 
be done to enable the AI initiatives. 


One key strategic agenda item is to develop an enterprise 
talent and organisational strategy for Al. Banks are 
recommended to assess their internal Al capabilities and 
identify skills gaps regularly as part of their overall Al 
strategy. To mitigate the shortage of Al talent, banks might 
formulate a talent strategy for continually investing in their 
digital talent pipeline, as well as building in-house skills 
through enterprise-wide training programmes. At the same 
time, they are recommended to define a new organisational 
structure that allows for efficient internal collaboration for Al 
development, since banking domain knowledge is needed 
to build up relevant Al models for specific use cases. 


To facilitate communication and collaboration between 
Al specialists and the bank’s business functions, banks 
can consider implementing a new cross-functional 
implementation team structure - DevOps. DevOps blends 
business analysts, developers and system operators in 
one integrated team focussing on one specific process. 
This reduces communication overheads and enables faster 
solution deployment, enabling banks to develop and deliver 
Al applications more quickly. 


2. Aligning strategy with regulations and industry 
standards 


When developing a corporate-level Al strategy, banks 
need to take regulations and industry standards into 
consideration. Banking is a highly regulated industry, and 
regulations relating to fairness, accountability, transparency, 
and ethics may affect the use of Al (this is further discussed 
in section 5.2). To ensure regulatory compliance, banks 
should survey existing and imminent regulations relating to 
Al, interpret them and establish a consensus on their likely 
regulatory impact, and validate their Al strategy relative to 
the regulations with key stakeholders. Banks should also 
survey and consider following existing international and 
local industry standards. It is recommended that the Al 
strategy is validated against the survey results with key 
stakeholders. 


3. Refining internal policies and practices 


After completing steps 1 and 2 above, banks are 
recommended to refine their internal policies and practices 
on data, infrastructure, algorithms, and talent to provide 
clear direction and guidance on the adoption of Al for 
different business units. Banks can create an inventory 
of existing policies and identify those which will impact 
on Al, and then come to a consensus on refining these 
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policies based on their Al strategy, along with regulations 
and industry standards. Taking data policy as an example, 
a bank may store its customer data and transaction data 
across different business functions and in different silo 
systems. However, Al requires large amounts of customer 
data for model training to obtain better customer insights. 
There is thus a need for clear guidelines on a data policy, 
and specifically on the sharing of customer data across 
different business functions. 


The frequency and response time of 
augmented or automated decisions 
that an Al application will make (e.g. 
the bank may consider the number 
of alerts that AlOps will generate in 
one month as one aspect of evaluat- 
ing AlOps) 


The potential risk of unexpected Al 
decisions on business operations, 
and the societal, ethical and work- 
force impacts of the Al application 
(e.g. investment recommendations 
given by robo-advisors may be 


unfair or lead to negative returns) Regulation 


The acceptable use, data protec- 
tion, and minimum level of functional 
validation required by regulation for 
a given Al application. (e.g. PDPO in 
Hong Kong) 





5.1.2. Planning 


At the planning stage, banks should first identify the highest 
value Al opportunities that align with their business strategy, 
and decide the extent to which the Al solution should be 
deployed. Given Al’s heterogeneous nature, there are no 
standardised formulae for evaluating business use cases. 
However, creating a comprehensive user story can help 
banks determine the level of performance needed from an 
Al solution. To clarify whether Al is appropriate for particular 
use cases, PwC proposes the following six factors for 
consideration in Al implementation planning: 


The business and economic impact 

(e.g. total investment, return on 

investment, profit brought) of 
= adopting the Al application 

TP 


Revenue 


The robustness and technical 
limitations of the proposed Al solu- 
tion; the minimum amount of data 
and infrastructure needed to enable 
meaningful predictions; the accura- 
cy and ability of the solution to 
generalise from out-of-sample data 
(e.g. if implementing the Al solution 
requires the purchase of additional 
computation power to ensure 
accuracy and robustness) 


o 
wt 


Reputation 


The impact on a bank’s business 
reputation of the Al application, and 
its reception and potential reac- 
tions by different internal and exter- 
nal stakeholders (e.g. data privacy 
may lead to reputation damage if 
not well managed) 


This assessment of business use case feasibility needs to be conducted collaboratively by technical and business teams 
to ensure all aspects are evaluated and gaps for implementation are identified. 
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5.1.3. Ecosystem 


After the use cases are evaluated and confirmed, banks 
need to assess their current Al talent and decide on an 
implementation model. Developing Al solutions always 
requires specialists, such as data scientists or algorithm 
programmers, who may not always be readily available for 
banks. To ensure the Al model is developed and deployed 
smoothly, a well-defined technology roadmap is essential, 
one which ensures internal and external collaborations are 
properly considered and managed. 


Banks may decide to adopt a mature external Al solution as 
part of their overall Al architecture (e.g. a Cantonese NLP 
engine) in cases where the solution is not data sensitive. In 
cases that require customer data for model training, such as 
robo-advisors, it is preferable for banks to create their own 
Al team to develop the solution. 


During Al implementation, a bank may need to collaborate 
with other banks, or even non-bank parties, to source data 
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to fuel the Al engine. However, it is recommended that 
banks carefully assess the financial viability and the data 
access controls of the vendor before incorporating such 
Al solutions into their infrastructure, given the fast-moving 
vendor landscape. 


It is also important for banks planning to deploy Al to start 
with a cohesive change management plan. Unlike traditional 
IT system implementation, the deployment of Al solutions 
could create uncertainties in the workforce. A robust change 
management framework that caters to the workforce is key 
to the success of any Al planning. 


5.1.4. Development and deployment 


After identifying the project objectives and carrying out 
thorough planning, banks need to follow an agile but 
well-governed development and deployment lifecycle. 
This includes the acquisition and selection of data, model 
training, model evaluation and testing, interpretation, and 
deployment. 


— 7 — 


Acquire & 


Train Al Model 





Select Data 


The first step of the cycle is to define, collect and prepare all 
the relevant data for use in model training. With appropriate 
selection of the data to feed the Al algorithms, the output Al 
model will be more accurate and fairer in its predictions. The 
selected data also needs to be divided into training data and 
testing data for model training and testing respectively. 


The next step is to select an Al algorithm and use the selected 
training data to train the Al model. The Al development toolsets 
in model training are different from traditional models. The latter 
usually consist of a single implementation of each algorithm in 
one commercial package. With multiple open source libraries 
and tools available for any given Al or machine learning 
algorithm, it is common for a modern analytics development 
process to build models relying on several different tools and 
packages which yield different results and performance. If 
there is no integrated model development environment, or at 
least some form of version control, there is a greater potential 
for inconsistency and errors. 


Evaluate and 





Test Al Model 
— a 


Interpret Deploy 





Al Model Al Model 


Once the training is complete, it is time to check that the 
model produces accurate and fair results. This is where the 
data identified as testing data is used. The evaluation allows 
the bank to test the model against data that has not been used 
for training. The result enables the development team to see 
how the model might perform on data that it has not yet seen, 
and thus how it might perform in the real world. 


The final step is to deploy the trained model. Due to the nature 
of machine learning, new Al models are frequently updated 
and deployed. Banks needs a robust handover process 
between the development and operation teams to ensure the 
deployment process does not disrupt existing processes. This 
is typically done by conducting automated continuous testing 
before deployment. 


Once deployed, production data will start to arrive. With more 
real data, user feedback and model adjustment, the model can 
be continuously improved. 
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5.1.5. Operate and monitor 


The operation of the Al solution will deviate from traditional 
IT applications because of the need to monitor and calibrate 
it, and the fact that the model is governed by a series of 
explainable, technology risk management principles. The 
model review cycle should be designed to comprehensively 
address the entire model ecosystem. 


Periodic reviews and ongoing monitoring are recommended 
so that banks can ensure robust execution and manage 
cybersecurity threats. This would include continuous review 
of models, re-validation of data and resources, and review 
of contingency measures such as human intervention or 
the use of conventional processes. The introduction of 
safeguard methods, such as scenario planning, is also 
recommended to prevent unintentional interactions. At the 
operating stage, as the model result will be affected by 
accumulated production data, it is also vital to maintain the 
quality and fairness of input data. 


Below is a case study of how a bank in Hong Kong (code- 
named Omega Bank) used the above Al implementation 
approach to develop a robo-advisor. 
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Omega Bank launches a Robo-Advisor solution to provide existing customers 
with professional investment recommendations 


Omega Bank wants to offer professional investment 
recommendations to ordinary customers without incurring 
a significant increase in operating costs. To achieve this, 
Omega Bank decided to develop a robo-advisor. 


Its robo-advisor implementation journey started with 
strategy planning. Omega Bank began by establishing 
guiding principles on ethical and explainable Al with 
reference to different frameworks, and implemented an 
overall governance process. To validate whether the use 
cases aligned with its strategic objectives and guiding 
principles, Omega Bank engaged both its business and IT 
teams early in use case discussions. It took multiple factors 
into consideration when evaluating the benefits, feasibility 
and limitations of the solution. 


As banking is a highly regulated industry, regulatory 
compliance is critical for emerging technologies like Al. 
Omega Bank therefore held several meetings to discuss 
controls to ensure compliance, and developed governance 
and risk management guidelines following the HKMAs 
guidelines on Online Distribution and Advisory Platforms. 
A data privacy framework was also developed to ensure 
customers’ data would be safely stored and processed in 
the implementation lifecycle, in compliance with the PDPO. 


Talent and partnerships are also critical factors for Al 
development. The bank considered its internal Al capabilities 
and evaluated which Al tools needed to be bought and which 
could be developed in-house. Considering the domain 
knowledge and data available in-house and its ability to 
maintain these, the bank decided to develop an in-house 


machine learning model for investment recommendations, 
while buying an NLP solution to analyse English, Cantonese 
and Mandarin from a local start-up. 


To develop the Al model, the bank first identified what data 
was required, including customer profile (e.g. age, annual 
income, gender, and industry), the customer’s investment 
preference (risk preference, market preference, and 
product preference), and the customer’s targeted return 
on investment, and other useful data. With both data and 
algorithm ready, the bank performed data labelling, model 
training with supervised machine learning, and model 
evaluation. A model with a satisfactory performance that 
outperformed the market index was developed after several 
rounds of testing and enhancement. To better explain the 
recommendations from the robo-advisor, a surrogate model 
was built using a decision tree to explain the key decisions 
of the robo-advisor, based on some key input variables. 


Once the robo-advisor went live, it outperformed the market 
and began to achieve customer traction. However, after six 
months of deployment, the bank performed a regular review 
and found that the robo-advisor had underperformed 
in one month due to an outlier event. The in-house data 
scientists documented the findings and re-calibrated the 
model by changing the selection of input variables. They 
then evaluated the updated model through thorough back- 
testing before redeploying the model. With continuous 
improvement and re-calibration, it is expected that the 
robo-advisor will be able to perform to the standard of a 
professional investment advisor. 


5.2. Key considerations in establishing a 
governance model 


The foundation stone of Responsible Al is end-to-end 
governance. Because decisions made by a Chatbot or 
an ML-driven prediction system are made by machines 
rather than humans, for many banks it can be unclear how 
such decisions should be governed. The board or senior 
management of banks should appreciate that they remain 
accountable for all Al-driven decisions. To ensure regulatory 
compliance, it is critical for banks to establish proper 
governance and risk management frameworks. 


To do this, banks first need to answer such questions as: 
How does Al align with our core business strategies? What 
controls should be in place to track performance? What 
are the risks and impacts that Al will bring to consumers, 
businesses, societies and nations? 


To help banks better build or update their Al governance 
model, PwC has summarised eight aspects that banks 
need to consider: 


[Consideration | Risks Details ———— 


e Compliance risk 
Compliance e Data privacy risk 
e Transparency risk 


e Value alignment risk 


Ethical 
e Global alignment risk 


e Black box risk 
Interpretability & Explainability e Compliance risk 
e Lack of value risk 


e Adversarial attack 
Robustness & Security e API model theft 


e Open source software risk 


e Fairness risk 
Fairness e Reputation risk 
e Lack of value risk 


e Job displacement risk 


e Uncertain or unassessable 


Economic ROI 

e Loss of institutional 

knowledge 
R e Reputational risk 

Societal pu a A 

e Intelligence divide risk 

e Lack of AI talent to detect 
Control 


or control rogue actions 
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A compliance review process should be defined to 
keep track of regulatory changes and ensure the bank’s 
policies and processes are compliant. 


Enterprise-wide definition of Al use cases, data collection 
and model training should be defined based on ethical 
principles 


The decision of the Al model should be interpretable and 
explainable to solidify the decision and avoid a ‘black- 
box’ threat 


Rigorous validation, continuous monitoring and 
maintenance, verification and adversarial modelling 
should be planned to improve the security and 
robustness of Al. Banks should cover all Al systems, 
data, and communications to ensure security. 


Al decisions should be fair and avoid discrimination on 
the basis of age, gender, religion, race, etc. 


Key performance indicators should be established to 
ensure the realisation of benefits, and also to mitigate 
unwanted outcomes that are sometimes overlooked 
when deploying technology changes. 


Consumer data-privacy settings should be examined to 
prevent possible social backlash. Personally identifiable 
information (Pll) data should be securely stored to 
prevent downstream individual implications. 

Effective Al operating models and processes should be 
designed to improve accountability and the quality of 
results. 


Among these considerations, compliance is one of the 
most important when setting up the governance model. 
As the banking sector is highly regulated, banks must 
monitor the regulatory environment in which they operate 
and understand how AI will shape their future business 
practices. As mentioned in section 1.3.1, banks need huge 
amounts of customer data to train the AI model, so data 
protection is a critical consideration. When developing 
Al applications, banks in Hong Kong need to ensure 
compliance with the Personal Data (Privacy) Ordinance 
(PDPO), any relevant codes of practice issued or approved 
by the Privacy Commissioner for Personal Data (PCPD) 
giving practical guidance on compliance with the PDPO, 
and any other applicable local and overseas statutory or 
regulatory reguirements. 
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Banks are also recommended to strive to develop, 
implement, and use AI solutions that are morally responsible 
and ethically defensible. To make ethical principles more 
actionable, banks could contextualise ethical principles in 
the form of specific guidelines for front-line staff. By doing 
this, banks may be able to identify the ethical implications 
of their AI solutions and the relevant principles that should 
be taken into account when designing and implementing Al 
models, better enabling robust mitigation of ethical risks. 


To ensure Al implementation is compliant with major 
regulations and to better visualise the above considerations, 
banks are recommended to establish a governance 
standard across their business units. In this way banks will 
have greater control and confidence throughout the six 
stages of Al implementation. Below is an example of what 
a governance standard across different business units and 
departments could look like: 


Illustrative 





Business Units of a Bank 





see [Branch [CASA [Leading [credit [Treasury [Operations [Risk [Corporate | 
10 10 10 10 10 0 10 


Compliance 10 
Ethical 10 10 10 10 
Interpretability 

1 1 
& Explainability Š ? ï 2 
ele nie & 40 10 10 10 
Security 
Fairness 10 9 10 10 
Economic 8 7 
Societal 9 9 
Control 8 8 





1 


9 10 9 10 
9 8 9 10 
10 10 9 10 
10 9 9 10 
9 8 
10 10 10 10 
8 9 8 9 


Score from 1 to 10. The higher the score, the more the BU should pay attention to the consideration. 
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5.3. Measuring and assessing Al 
implementation 


It is important to accurately measure and assess governance 
issues when implementing an Al solution. A Toolkit is 
introduced to help assess features of Interpretability and 
Explainability, Robustness and Security, and Fairness 
during Al implementation. 


5.3.1. Interpretability and Explainability 


As discussed in section 3.4.1, lack of explainability is a 
major barrier to banks adopting Al. Before deploying Al 
applications, banks need to ensure that decisions made 
by Al are explainable to all relevant stakeholders, including 
customers, regulators and business sponsors. Banks should 
implement adequate measures during the design phase 
to ensure a level of explainability that is appropriate and 
commensurate with the materiality of their Al applications. 


Banks should consider the following when building 
explainable Al: 


1. Confirm the level of explainability: the verification and 
validation process for different Al use cases may require 
different elements of explainability. For example, an Al 
system for targeted advertising requires a relatively low 
level of explainability, as the consequences of it going 
wrong are negligible. On the other hand, the level of 
explainability for an Al-based recommendation system 
(e.g. arobo-advisor) must be significantly higher, because 
any errors could not only lead to investment losses, but 
also damage the reputation of similar systems and even 
of the bank itself. 


2. Start at the design phase: in most engineering processes, 
developers must consider the capabilities of systems 
at the early stages of the design phase. Explainability 
is no different — it needs to be considered up front and 
embedded into the design of the Al application. This 
will affect the choice of AI algorithm, the means of data 
processing, and the type of data to be collected. 


3. Choose the right algorithm: different algorithms have 
different levels of explainability. Decision trees, for 
example, are highly amenable to explanation. It is possible 
to build commercially useful models where the entire 
decision process can be diagrammatically illustrated. 
Neural networks or deep learning, on the other hand, 
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while amenable to graphical analysis, contain many 
more connections and have more subtle properties with 
respect to node interactions that are inherently difficult 
to explain. 


There are two types of interpretability: global and local 
interpretability. Global interpretability means that the 
developer can comprehend the entire model at once. This 
level of interpretability is about understanding how the 
model makes decisions, based on a holistic view of its 
features and each of the data inputs, such as data type 
and data weight. Local interpretability, on the other hand, 
means the conditional interaction between dependent 
and independent variables for a specific observation. The 
tools described below can help banks in measuring and 
assessing both global and local interpretability: 


1. Global interpretability - surrogate modelling tool: this 
approach involves building another model alongside the 
trained model using a more explainable algorithm, e.g. 
a decision tree. It can mimic the output of the trained 
black box model and help data scientists to understand 
approximately how the black box model works. This 
model reveals which variables are most important, and 
how they affect the predictions. 


2. Global interpretability - partial dependence plot: this 
approach evaluates how variables affect the prediction 
by simulating all the combinations of input variables. It 
provides a visualised diagram of the relationship between 
variables and prediction. 


3. Local interpretability measurement tool: this tool 
quantifies the importance of a selective input variable by 
testing the sensitivity of the model’s result to the value 
change of that input variable. This tool measures and 
sorts the importance of input variables, and thus provides 
a better understanding of how the model works. 


5.3.2. Robustness and security 


Robustness and security are two other vital factors in 
successful Al implementation. Al solutions need to be 
robust and secure to be effective and reliable. Banks are 
encouraged to implement effective cybersecurity measures 
to avoid or control potential cybersecurity threats such as 
data poisoning and adversarial attacks. 


The following tools can help banks measure and assess 
robustness and security: 


1. Model Stability Assessment Tool: this tool assists users in 
assessing the prediction stability and sensitivity of trained 
models. Banks can use this tool to check if a trained 
model would generate the same outputs for identical 
inputs and generate similar outputs for statistically- 
similar inputs after model updates. 


2. API Theft Assessment Tool: this tool attempts to perform 
model extraction attacks on a ML model to see whether 
it is possible to replicate the model using an open API. As 
ML models are considered important IP, banks can use 
this tool to test if their models could be subject to cyber 
theft, and enhance their security accordingly. 


3. Synthetic Data Generator: this tool learns from existing 
data and generates statistically similar data while 
maintaining data complexity. If banks cannot obtain 
enough training or testing data, this tool can help provide 
data for model testing or training. 


5.3.3. Fairness 


In all stages of the Al implementation life cycle, bias can 
be introduced. This could lead to unfair predictions based 
on race, gender, religion or other factors. Bias can cause 
decisions made by the model to lead to unfair or inappropriate 
risk levels or product offers, or even to contravention of anti- 
discrimination laws. Banks should ensure that Al-driven 
decisions do not discriminate or unintentionally show bias 
against any group of consumers. In addition, banks should 
ensure that Al models produce objective, consistent, ethical 
and fair outcomes to customers. 


To ensure fairness in the Al model, banks need to formalise 
enterprise-wide governance and consistent quality control 
policies and processes for evaluating data sources, 
understanding how the data is organised, and testing it 
to ensure that it is representative and that it achieves fair 
outcomes for customers. There need to be procedures to 
detect whether the training data is sufficient and the data 
quality is good enough to build a robust and fair Al model. 
Other than evaluating the data, banks are also recommended 
to mitigate bias by leveraging tools and techniques that can 
improve the transparency and intelligibility of models. 


While they is no universal definition of fairness (there 
are more than twenty definitions of fairness, according 
to different regulations and organisations” ), it is 
recommended that banks establish company policies 
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and select approaches that will Support the fairness 
assessment according to their own definitions and 
requirements. The following bias detection tools can help 
banks to check whether a model discriminates against 
various groups or individuals given a test dataset, as well as 
to correct biased outcomes for various groups or individuals 
(e.g. age groups): 


1. Bias Detection — Comparing Performance Metrics: 
this tool analyses performance of the model across 
different data groups, by calculating the rates of correct/ 
incorrect predictions. Banks can use this tool to compute 
performance metrics for each possible group in the test 
dataset. 


2. Fairness Detection Tool — Measuring Fairness: this 
tool compares the outputs of disadvantaged versus 
advantaged groups using fairness definitions, e.g. equal 
opportunity or Gini equality. Banks can use this tool to 
quantify discrimination against disadvantaged groups 
and solve unfairness issues. 


3. Bias Intervention Tool — Thresholding: this tool computes 
the decision boundaries of fairness for discriminated 
groups by re-adjusting thresholds until disparities 
between the different groups are minimised. Banks can 
use this tool to refine the model to reduce bias. 
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Al for Hong Kong banking: 
looking forward 


Key Takeaways 


We here summarise four recommendations for stakeholders 
in the banking sector to consider in support of the industrial 
development of Al: building an Al-ready workforce; 
establishing an AI Centre of Excellence (CoE); fostering a 
Hong Kong banking AI community and infrastructure; and 
drafting risk guidelines. 


To better prepare for the current and next wave of AI, 
banks should build up an Al-ready workforce. Options 
include reskilling existing employees, and encouraging 
employees to participate in professional training or Al- 
related co-university programmes. An Al CoE is defined 





as a team within a company that provides leadership, best 
practice, research, support and/or training in a given area. 
Establishing an Al CoE to manage the bank’s Al investments 
and project portfolio will help banks refine their operating 
models. Fostering Hong Kong’s banking Al community 
and infrastructure can help support banks in adopting and 
applying Al technologies. Lastly, drafting guidelines on Al 
risk management in consultation with working groups of 
banks and regulators would also help to better manage 
regulatory compliance. 
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To facilitate Al development in Hong Kong’s banking 
industry, having a comprehensive implementation and 
governance framework such as that provided in Chapter 5 
is only the first step. The ultimate goal is to develop Hong 
Kong into an Al Innovation Hub which attracts talent and 
investment from around the globe so as to maintain the city’s 
competitiveness in the digital era. In pursuit of this goal, we 
present four recommendations in support of the industrial 
development of Al for banks and other stakeholders (Such 
as Al technology vendors and innovation incubators in the 
banking ecosystem). 


These recommendations aim at tackling challenges and 
risks arising from Al adoption from the perspectives of 
internal preparation, external collaboration and regulatory 
guidelines. They include building an Al-ready workforce, 
establishing an Al Centre of Excellence, developing a Hong 
Kong banking Al community, and managing Al technology 
risk. 


6.1. Building an Al-ready workforce 


According to our survey, 75% of the respondent banks 
which have considered developing Al solutions say they 
would prefer to manage the research and development 
stage internally rather than outsourcing it. Some state 
that this is mainly because banking domain knowledge is 


Citizen Al users 
get expert help when needed. 


Citizen Al 
developers 





Providing industrial training 


Besides internal reskilling, the banking sector should also 
support industry-wide professional training. 


Hong Kong has a long history of industry-wide organisations 
(such as The Hong Kong Association of Banks (HKAB) and 
other professional education providers) offering effective 
industry or subject-specific training to practitioners in the 
banking industry. 


An Al curriculum could be added to existing programmes 
and these organisations could leverage their platforms 
to accelerate industry-wide upskilling. In our interviews 
with banks, many suggested that government funding 
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necessary to build a suitable model for specific business 
use cases. This indicates that having an internal workforce 
ready for Al adoption is a key factor in determining whether 
a bank can successfully adopt Al solutions to address its 
business problems. However, as mentioned in section 3.4.2, 
the shortage of appropriate talent is a major challenge faced 
by banks. 


The following four recommendations are made with the aim 
of helping banks get their existing workforce Al-ready and 
increase their future talent supply. 


Reskilling existing employees 


Reskilling existing employees is a sustainable way for banks 
to tackle the problem of insufficient Al talent in Hong Kong, 
by transforming traditional banking professionals into Al- 
ready professionals. Such employees can provide a solid 
foundation for long-term Al development within the bank. 


As an international financial centre, Hong Kong enjoys the 
unique advantage of having a large portfolio of banking 
talent in multiple disciplines and with diverse backgrounds 
and nationalities. It also has the breadth and depth in 
terms of banking services offerings to attract the required 
talent. One recommendation is therefore for Hong Kong 
to upgrade its existing banking workforce and to define a 
talent strategy to nurture and upskill non-Al professionals to 
become ‘citizen Al users’ and ‘citizen Al developers’. 


The majority of staff, including front line and operations, should be trained to be able to use the 
bank’s Al-enhanced applications. They should also be able to support good data governance, and 


5%-10% of business unit professionals who are power users should be trained to identify use cases 
and quality data sets, and work closely with Al specialists in developing new Al applications. 


(e.g. extending the coverage of Hong Kong’s Continuing 
Education Fund) could also be provided to subsidise training 
in order for Hong Kong’s banking sector to rise above the 
Al baseline. 


Merely upskilling the existing workforce will not overcome 
the Al talent shortage. Some interviewed banks noted that it 
is very difficult and costly to upgrade banking professionals 
to become advanced data scientists and data engineers. 
However, data scientists possessing advanced analytical 
skills, and data engineers who can create, deploy, and 
manage Al applications, are key to elevating Hong Kong’s 
position in Al. Solving the root causes of this shortage 
through education and talent transfer programmes is the 
only effective route. 


Co-University programmes 


The statistics given below suggest that one of the reasons 
for the shortage of experts is the mismatch between the 
supply of students and demand from employers. 


Currently, limited numbers of university programmes are 
heavily FinTech focused. According to the Hong Kong Joint 
University Programmes Admissions System (JUPAS) and 
local university websites, there were 29 FinTech related 
programmes in Hong Kong in 2019 (ten Bachelor's and 
nineteen Master's degree programmes) (see Appendix C). 


However, a significant talent gap still exists within the 
industry because not enough graduates possess the 
relevant technical skills and capabilities. The 2019 survey 
and dialogue “Hiring in FinTech” by Michael Page® revealed 
that 6496 of employers found the recruitment process for 
FinTech talent challenging, while 4796 agreed that the 
key challenge was a shortage of proven skills for FinTech 
related roles. Moreover, 80.7% of respondents in our survey 
recognised ‘Hard to find employees with Al expertise’ as a 
significant barrier for Al adoption. The talent that is available 
lacks the exposure and academic training for both emerging 
technologies and banking operations. 


Obviously, the supply and quality of talent in Hong Kong is 
inadequate to support the growing demand for technology 
specialists by established institutions and virtual banks. 
A lack of data scientists and computer scientists with 
domain-specific knowledge in banking is hindering the 
overall development of the sector. 


Some of the banks we interviewed mentioned that culture 
and lack of career support could be reasons for this 
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shortage. While Hong Kong’s universities are well known 
around the world, few overseas students are willing to 
pursue technology-related careers in Hong Kong. 


To mitigate this problem and nurture the talent required to 
develop Al, the co-development of Al-related programmes 
by financial institutions and universities is recommended. 
In Hong Kong, the HKMA launched the FinTech Career 
Accelerator Scheme (FCAS) in 2016 in collaboration with 
the Hong Kong Applied Science and Technology Research 
Institute (ASTRI) to meet the growing need for FinTech talent 
in Hong Kong®". One of the programmes in FCAS is the Gap 
Year Replacement, which allows students from participating 
tertiary education institutions to work on FinTech-related 
projects full-time at banks or the HKMA for six months or 
one year. Banks could consider joining this programme or 
developing their own co-university programme (e.g. a one 
year internship) to expand the FinTech talent pool. With early 
exposure to real industry experience, students can build 
both banking and Al-specific Knowledge, thus reducing the 
talent gap. 


Talent/Capability transfer programmes 


Although Hong Kong is well known for the quality of its 
academic institutions, it has started losing out in the 
competition for the global talent pool, according to a report 
issued by IMD World Talent Ranking in 2018®, falling from 
12th to 18th. 


To increase its competitiveness and relieve the pressure 
of this talent shortage, a wider market for talent should 
be considered. Programmes to import technology talent 
should be launched and accelerated. For example: 


A three-year pilot scheme which provides a fast-track arrangement for eligible technology 
companies or institutes to admit overseas and Mainland technology talent to undertake research 
and development work. 


TechTAS 


The HK Innovation and Technology Commission (ITC)® established an AIR@InnoHK® programme in 
the second half of this year to support research into AI and robotics technologies and its application 


AIR@InnoHK a : f F ; i : 
in areas such as financial services. The research may cover big data analytics, machine learning, 





Besides relying on external resources, one recommendation 
is to launch a programme to send locals overseas to learn 
from more developed economies. Hong Kong could thus 
leverage the successful experience of other markets. 
For example, Singapore launched a ‘Capability Transfer 


cognitive systems, intelligent agents, etc. 


Programme’ in 201755 which aims to transfer capabilities 
from foreign specialists to locals. The programme provides 
a 90% (up to SG$300,000) allowance for sending eligible Al 
staff overseas for training, or for hiring overseas experts to 
train local staff. 





80 MichaelPage. Hong Kong: Hiring in Fintech Survey and Dialogue. https://www.michaelpage.com.hk/sites/michaelpage.com.hk/files/16785-hk 





fintech_brochure_mp.v8.pdf 





" FCAS (2019). FinTech career accelerator scheme (Gap year placement 
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6.2. Establishing an Al centre of 
excellence (CoE) 


Whichever stage banks are at on their Al journey, they 
will need to continuously discover ways to enhance their 
business using Al in order to justify their investment. Rather 
than just relying on the resources of their technology 
departments, banks should advocate Al adoption across 
the business to adopt the Al solution more quickly and with 
higher quality results. To achieve this, they should consider 
establishing an Al Centre of Excellence (CoE). 


A CoE is defined as a team within a company that provides 
leadership, best practice, research, support and/or training 
in a given area. For Al development, a CoE could act as a 
centre which collects required resources for Al development, 
such as use cases and initiatives. It could also act as a 
centralised knowledge base which provides suggestions 
and recommendations on the Al technology to be chosen 
and diffused throughout workflows and processes®. 


Internal 
Facilitation 
front line staff. 


different projects and initiatives. 


Model 
Development 


Initiative 
Management 


Ethical 
Management 


algorithmic bias and other factors. 





For banks that are exploring the possibilities of Al, an Al CoE 
can help to accelerate the development and adoption of Al 
within the organisation. Establishing an Al CoE to manage 
the bank’s Al investments and project portfolio will refine its 
operating model by allowing teams from the business and 
the IT department, as well as Al specialists, to collaborate in 
solving specific business problems. More re-usable models 
and Al components will be created as a result, and the ROI 
for investing in Al projects will become easier to calculate 
over time by referencing project successes and lessons 
learned. 


At the same time, given the ethical issues that can arise 
from Al, an Al CoE should also emphasise the provision 
of ethical guidance. By establishing a centralised ethics- 
related position or review board, banks should be able 
to more easily align the Al technology with their broader 
ethical, governance and privacy regulations. 


It is suggested that the Al CoE should be capable of 
facilitating the following: 


Facilitating Al development by increasing user acceptance: 


e Al Education Programme: To develop and execute enterprise-wide educational programmes on 
key Al tools, platforms, use cases and regulations, covering every level from board members to 


Building up a solid foundation and infrastructure for model development: 
e Reusable Digital Assets: To create inventories and advocate Al digital assets be developed in 


e Implementation Support: To provide advanced AI specialist resources to support Al development. 
Ideally the Al CoE should create and manage a one-stop digital platform for collaboration, support 
and resource management, representing a system environment with pluggable tools, shared data 
sets, and documentation on reusable components, methodologies and lessons learned. 


Helping manage and monitor all the Al initiatives, providing a clear direction in strategy setting, and 

serving as a centralised knowledge centre for Al implementation: 

e Use Case Sharing: To develop and share internal use cases of Al, and to research and introduce 
Al use cases implemented by peer banks to the organisation. 


e Al Project Portfolio Management: To maintain the Al project portfolio, including ROI tracking and 
to create Al project prioritisation metrics according to the bank’s corporate strategy. 


Guiding the business and development team in developing ethical Al solutions: 


e Ethical guidelines: To build comprehensive ethical guidelines according to governance, regulations 
and industry standards for Al solution development. 


e Ethics check: To help test if the Al solution is ethical based on fairness, impact on customers, 


86 Ramel, D. (17 January 2019). How to create an enterprise “Al Centre of Excellence”. https://pureai.com/articles/2019/01/17/ai-coe.aspx?m=1 
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6.3. Fostering Hong Kong's banking Al 
community and infrastructure 


Throughout this research, a recurring challenge cited by 
banks has been the lack of Al use cases available to help 
them understand how to start adopting Al. A failure to identify 
beneficial business use cases brings challenges when 
justifying investment in Al, thus hindering its development. 
Setting up a CoE is one solution, as discussed in the 
previous section, while another is collaboration between 
banks. By understanding the state of play across the 
industry and leveraging others’ experience, slower market 
followers can catch up. 


To promote and support industry-wide collaboration, a 
neutral forum should be established to encourage banks 
in Hong Kong to adopt and apply Al technologies. The 
objectives of the forum would include: 


e To facilitate the discussion and identification of use 
cases, lessons learned, and challenges faced by banks 
when implementing Al. 


e To evaluate open source and other commercial solutions 
in terms of their suitability for banks and their compliance 
with banking regulations. 


e To solicit and coordinate an industry response to relevant 
regulatory and industry consultations on Al. 


e To co-create a digital framework on advanced 
technologies (e.g. federated learning). 


Apart from including all authorised institutions in Hong Kong, 
certain other stakeholders should be involved in the forum 
so as to foster a more comprehensive banking ecosystem. 
For example, involving research organisations (Such as local 
universities and ASTRI) and technology companies would 
help accelerate discussions on how banks could utilise the 
newest Al technologies and develop innovative use cases. 


To achieve its objectives and generate new insights, 
the forum should host regular meetings to discuss both 
technical and non-technical issues. The forum should 
also serve as a knowledge centre. A document library 
could be established containing documented use cases 
and technical specifications. At the same time, quarterly 
reports could be generated to update participants on Al and 
financial developments. Besides discussion and reference, 
the forum could develop a sandbox infrastructure where 
talent nominated by member banks could co-create digital 
assets or Al prototypes. The findings from these pilots could 
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then be shared with other forum participants. 


At the same time, the Al technology providers we interviewed 
also found it challenging to showcase their solutions in a 
banking context without access to actual banking data and 
systems. The need for a general purpose Al infrastructure 
that would enable banks to start their Al experiments and 
journeys is clear. The sandbox-like infrastructure could 
provide a simulated banking environment for the technology 
vendors. This would also help banks better understand and 
evaluate their Al solutions. Programmes offered by the Hong 
Kong Science and Technology Park and other organisations 
may help, but the group needs to be able to formulate a 
plan to utilise public resources in order to advocate the Al 
agenda in banking. 
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6.4. Drafting guidelines on risk 
management in Al 


Another challenge for banks when pushing forward their 
Al agenda is uncertainty over regulatory risk. This has led 
to banks taking a conservative approach to Al deployment 
due to worries over non-compliance after significant 
investment. The deployment of Al technology impacts a 
number of existing practices that are subject to regulation. 


In light of this, the HKMA’s High-level Principles on Artificial 
Intelligence circular? provides guidance to the banking 
sector on the use of Al applications. The twelve principles 
listed in the document cover governance, application 
design and development, and on-going monitoring and 
maintenance for banks to apply in proportion to the nature 
of their Al applications and the level of risk involved. In 


addition, the HKMA’s Consumer Protection in respect of 
Use of Big Data Analytics and Artificial Intelligence (BDAI) by 
Authorized Institutions circular?** recommends attention in 
four major areas, namely (i) governance and accountability, 
(ii) fairness, (iii) transparency and disclosure, and (iv) data 
privacy and protection. 


To further facilitate communication on risk management, 
the HKMA launched its FinTech Supervisory Chatroom in 
2017% with the aim of providing feedback to authorised 
institutions on their technology initiatives at an earlier stage 
and with a shorter response time (i.e. within seven working 
days). 


With greater guidance, banks will become more confident 
and readier to apply Al solutions. The following are some 
topics that may benefit from discussion in the chatroom: 


Use of Production Data 


Guidance on the controls regarding using production data for Al development and the 


sharing of production data when collaborating with third party Al technology providers. 


Human decision controls on Al 
models 


Expectations on the degree of human controls in place, and particularly on the degree 
of straight-through processing by the Al model that is allowed. 


A cloud platform provides flexibility and scalability in terms of the storage and 


Deploying cloud-based Al 
solutions/models 


computation power required for Al development. Many third-party Al solutions are 
available on cloud platforms only. Banks would appreciate greater clarity on the 


permissible scope of cloud deployment. 


As with concerns over Open Banking APIs, clarity on controls and safeguards for 


TSP and outsourcing 
arrangements 


straight-through processing and outsourcing arrangements (e.g. tagging of data for 
model learning) will help banks to design appropriate ecosystems to deploy Al. 





87 HKMA (28 November 2017). Fintech Supervisory Chatroom. https://www.hkma.gov.hk/eng/news-and-media/press-releases/201 7/1 1/20171128-4/ 
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Conclusion 


Artificial Intelligence is a term that has been in use for over 
half a century. Despite its familiarity, for much of its lifetime 
this nascent technology has failed to live up to its billing. 
While computing power grew exponentially, advances in AI 
were modest and incremental. 


However, as this paper has outlined, a number of critical 
factors came into alignment in recent years and transformed 
something long anticipated into a reality. The consequent 
pace of technological progress has been astonishing, as 
anyone who has used freely available online translation 
software in the last couple of years will have noticed. 


What brought this about? First, there was the transition 
from rule-based to non-rule-based learning, then the 
development of complex algorithms and critically the 
'fuel” of vast amounts of data to facilitate machine learning. 
As a result, the basic building blocks, components and 
technigues enabling AI to support banking and other 
industries are now in place. 


However, the ongoing technical difficulties of harnessing 
Al are now combined with other challenges. These include 
user acceptability, finding and retaining expert talent, 
integrating newly-enabled products and services into a 
well-established business strategy, and so on. One of the 
most critical issues for financial institutions and any other 
organisation built on trust is the concept of ‘explainability’. 
Encouraging familiarity with, and eventually confidence in, 
‘black box’ processes is a daunting challenge from both a 





risk management and a customer experience perspective. 


Despite this, the potential to dramatically improve customer 
experience, reduce costs and better manage risk, is 
something no bank can ignore. The drivers for Al adoption 
are just too compelling. Further, the research cited in this 
paper shows that there is also a high degree of ‘market 
pull’, as customers demand more personalised and relevant 
tools to manage their financial and other goals. 


To fully unleash the potential of AI in the banking industry, 
best practice needs to be leveraged and learned from. 
Hong Kong can develop into a leading AI hub by drawing 
on some of its core strengths, such as its deep and long- 
established expertise in banking and professional services. 
However, other aspects of the city’s human capital will also 
need to be intensively nurtured. 


Few players in Hong Kong’s banking industry would claim 
to have a fully-fledged AI strategy or a thorough, all-round 
understanding of how the regulatory landscape will change 
because of Al. For these and other reasons, knowledge 
sharing of tried and tested use cases across Hong Kong’s 
banking Al community is essential. This should encompass 
success stories, as well as stories of those that fell short, 
either technically or due to a lack of real demand. In this way 
Hong Kong's financial institutions will be able to develop an 
industry-wide understanding of the rules of the road for this 
new technology. 
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Appendix A: Glossary of Terms 


and Definitions 
lili [Detinition 


4G is the fourth generation of broadband cellular network technology. A 4G system 
4G provides capabilities as defined by the International Telecommunication Union 
(ITU) in its IMT Advanced Standard. 


5G is the fifth generation of cellular network technology. It operates on three 





5G i 

spectrum bands that provide faster download and upload speeds. 

AlOps stands for artificial intelligence for IT operations. It refers to the multi- 
AlOps layered technology platforms that automate and enhance IT operations. It utilises 


big data, machine learning and other advanced analytics technologies to enhance 
IT operation functions with proactive and dynamic insights. 


Alternative data refers to some non-traditional financial transactions and behaviours 
used by banks to gain additional insights into a company or investment. 


An application programming interface (API) is an interface or communication 
protocol for programmers to build software applications. It has been described as 
a “contract” between applications, ensuring that one application will always get a 
response in a specific format from the other after initiating a defined action. 


Artificial intelligence (Al) is the intelligence demonstrated by machines when a 
Artificial Intelligence machine mimics the cognitive capabilities of the human mind, such as learning, 
reasoning and recognising. 


Big data means the expansion of data in terms of volume (size of the data), 
Big Data variety (type and nature of the data) and velocity (speed of data generation and 
processing). 


A Chatbot is an artificial intelligence (Al) software that simulates conversation 
via text or speech. It is commonly used in dialogue systems for various practical 


Chatbot purposes, including customer service or information acquisition. Some Chatbots 
use sophisticated natural language processing (NLP) systems to process 
conversations. 

A Algorithm to predict the which one of the finite number of categories the model 

Classification 


target belongs to. 


Cloud Computing is the use of a network of remote servers hosted on the internet 
to store, manage and process data. 


A clustering algorithm divides data into groups (a.k.a. clusters) with similar traits 
and features. 


Computer vision is a field of artificial intelligence that trains computers to interpret 
Computer Vision and understand the visual world. It trains machines to accurately identify objects 
and respond accordingly. 
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Decision Tree Classification 


Decision tree learning trains the machine learning model to make a prediction or 
classification decision based on a sequence of tests. The decision tree start with 
a test of the inputs. Each test can have multiple outcomes, and these outcomes 
can lead to new tests (represented as new branches on the decision tree) or to 
conclusions about the target variables (represented as leaves on the decision tree). 


Deep learning is a subset of machine learning that is capable of recognising and 
learning subtle structural patterns from vast amounts of unstructured data. It 
mainly relies on a neural network (NN) to ‘learn’ the patterns from billions of input 
parameters by modelling hundreds of thousands of deep neural network hidden 
layers. 


DevOps is a set of practices that combines software development and IT 
operations. It automates the processes between software development and IT 
teams to shorten the development life cycle and provide continuous delivery with 
high software quality. 


Disaster recovery (DR) is a process to recover an organisation’s operations with 
the aim of minimising the impact of an event on the organisation’s IT infrastructure. 
DR enables the organisation to maintain business continuity or quickly resume 
mission-critical functions following a disaster. 


Edge computing is a system architecture that moves part or all of a computing 
process running from a centralised server to a local device (i.e. edge), such as 
a user’s computer or mobile device. It minimises the amount of long-distance 
communication that has to happen between a client and server, hence reducing 
total network bandwidth and process running time. 


Emotion detection is the process of identifying human emotion, usually from facial 
and verbal expressions. 


Explainable Al (XAI) refers to the methods and techniques that help to make the 
results of a solution understandable to human experts, addressing the issue of 
black box decisions. 


A case in which a model mistakenly predicts a negative. For example, an AML 
prediction model infers that a transaction is not money laundering when it actually 


A case in which a model mistakenly predicts a positive. For example, an AML 
prediction model infers that a transaction is money laundering when it actually is 


Federated learning is a specific category of distributed machine learning 
approaches which trains machine learning models using decentralized data 
residing on end devices such as mobile phones. 


Handwriting Character Recognition (HCR) receives and interprets intelligible 
handwritten inputs. It involves the automatic conversion of text in an image into 
letter codes which are usable within computer and text-processing applications. 


Intelligent process automation (IPA) is the collection of technologies that combines 
process automation with Robotic Process Automation (RPA) and Machine learning 
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The Internet of Things (loT) is a category of technologies that connect devices 
and enable them to exchange data over a network. The ‘things’ include mobile 
phones, smart electrical appliances, wearable devices, and sensors embedded 
into mechanical or electronic components of machines. 


The degree to which the model is understandable by a human user. This is usually 


Internet of Things 


Int tabilit 
Mere roth measured by the model's explainability, transparency, and provability. 

A label is the prediction target—the y variable in simple linear regression. The label 
Label could be a continuous variable (e.g. stock price) or discrete categories (e.g. fraud 


Linear regression is a simple statistical method to model the direct proportional 
(i.e. linear) relationships between independent input variables and the dependent 
output variables. The model can be trained to predict the output variables’ values 
by inferring the linear relationships 


Logistic regression is a variant of linear regression for classification tasks, e.g. which 
Logistic Regression output variable is categorical (e.g. sunny, rainy, cloudy) rather than continuous 
(e.g. degree in temperature) 


Long Short-Term Memory neural network (LSTM) is a recurrent neural network 
(RNN) architecture that is capable of handling sequential relationships within 
unstructured data. In contrast to simple RNN, which usually predicts using short 
term memory, LSTM has a mechanism to define and refresh (add and forget) a 
longer term memory. 


Algorithm to help machines learn relationships from training data and infer rules 
for prediction. 


Narrow Al is a machine intelligence that allows a machine to apply a certain level 
of cognitive capabilities within a specific context or problem. 


Natural Language Generation (NLG) is an Al technique that uses computational 
linguistics to create human-like textual content from structured data. 


Natural language processing (NLP) is a group of Al techniques used to read, 
recognise, process, analyse and understand human languages so that computer 
algorithms can enable human-computer interactions using natural language, 
including language translation, sentiment analysis and interactive conversation 


A neural network (NN) is a machine learning model with multiple hidden layers 
of interconnected prediction units inspired by the interconnected neurons in the 
human brain. Based on the number of hidden layers, NN can be further classified 
into a shallow neural network (SNN) or a deep neural network (DNN). DNN can build 
model using huge (e.g. millions) of input variables and represent the hierarchical 
patterns of inputs at various levels of abstraction and semantic concepts. 


Optical character recognition (OCR) is a technique to convert handwritten or 
Optical Character Recognition printed texts on image files (such as a scanned document or photo) into machine- 
readable text. 


An outlier is an observation point that is distant from other observations in the data 
having similar input variables. 


Personal Identifiable Information (Pll) is any information that can be potentially 
Personally Identifiable Information used to distinguish one person from another and can be used for de-anonymizing 
anonymous data. 


Long Short-Term Memory Neural 
Network 


Neural Network 
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Power User 


Reward 


A power user refers to the user of computer hardware, software or a related device 
who is capable of using its advanced features. 


Reinforcement learning (RL) is a machine learning model that trains machine 
agents to reinforce actions taken under specific conditions so as to maximise 
cumulative rewards 


In reinforcement learning, a Reward (or reward function) is a mathematical 
function defined by data scientists to score and feedback on the machine agents' 
behaviour. The preferred machine behaviours are predefined by humans in the 
reward function. It gives rewards to machine agents when their behaviours are 
aligned with the reward function, and penalises machine agents otherwise. The 
aim of the reward function is to reinforce the behaviour of the machine agent under 
specific conditions through continuous feedback. 


A robo-advisor is a financial advisor platform that provides automated, algorithm- 
driven financial planning. 


Robotic Process Automation (RPA) is the use of software that acts as a metaphorical 
software robot or Al worker to handle high-volume, repetitive tasks. 


The ability of the model to withstand or overcome adverse conditions and maintain 
consistent performance and accuracy of predictions. 


Semi-supervised learning is a class of machine learning that makes use of a 
training dataset combining a small amount of labelled data with a large amount of 
unlabelled data, through bootstrapping the labels. It is a combination of supervised 
learning and unsupervised learning. 


Sentiment analysis refers to identifying, extracting, and qualifying affective states 
and subjective information using natural language processing, text analysis, 
computational linguistics and biometrics. 


Supervised learning is a type of machine learning that learns the relationship of 
inputs and outputs and infer the rules to predict the output given a set of inputs. 
The model is trained using labelled training data consisting of a set of input-output 


Synthetic data is artificially generated data which are not obtained directly by 
observing real-world events or actual measurement. Synthetic data is created 
algorithmically to closely mimic the characteristics of production data but is 
anonymised so that personally identifiable information are excluded. 


A test dataset refers to datasets for assessing the performance of a trained 
model. It is independent of the training dataset, but follows the same probability 
distribution. 


Training datasets are datasets of examples used for learning which fit the 
parameters of the model. 


Transfer Learning is a class of collaborative machine learning that transfers a 
model trained for a specific task and corresponding parameters and uses them 
as the starting point for model training on a second related task, so that model 
training on the same set of data does not have to be repeated 


Unsupervised Learning is a type of machine learning that learns from non-labelled 
and non-categorised training data. It aims to identify commonalities of features in 
the data. 
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Appendix B: Survey results 


The purpose of the questionnaire was to collect high-level information from selected Authorized Institutions under the 
HKMA about the status of Al in the banking sector in Hong Kong. 


The survey contained seven sections: ‘Al Use Cases’, ‘Resources investment’, ‘Incentives and benefits, ‘Barriers to Al 
adoption’, ‘Risks and accountability’, ‘Ethical issues’, and ‘Regulations related to Al adoption’. 


The team has analysed and summarised the data in the survey and generated the following insights for retail banking as 
input for this research paper. 


J Retail Banking Industry 


i 


To what extent are banks in Hong Kong adopting Al 
technology? 


89% 





E Have plan E No plan 


Current 
e Anti-Money Laundering (AML) 


e Cyber security applications 
e Operational automation 
e Fraud detection 


e Know-Your-Customer (KYC) 


Near Future 
2 What are the Al use cases? 
e Client-facing Chatbots 


e Biometric customer identification 


e Remote onboarding 


Exploration Stage 
e Report generation using NLG 


e Sentiment analysis 


* Credit scoring 
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How much capital will you invest in Al in the next 5 Total capital investment in Al will increase 











3 
years? by 70% 
4 Are banks expanding their Al workforce in the next 5 
years? 
Are banks developing Al capabilities through 
5 partnering with external technology firms for Al 
implementation? 
E Yes WNO 
Are banks developing Al capabilities through 
6 managing the research and development stage 
internally? 
82% 
E Yes E No 
7 Are banks with Al adoption utilising Al to shape their 


corporate strategy? 





E Yes E No 
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What are the reasons for banks to apply Al 
technology? 


Improving customer experience 


Strengthening risk management 


Reducing costs 


What are the barriers affecting banks' adoption 
of Al? 


Lack if employees 
with Al expertise 


Insufficient data 


Design ethics of Al 


48% 








Data privacy & security Ea 


Legal compliance 
challenges 


10 


What are the major Al risks identified by banks? 


Lack of expertise or 
resources 


Biased decisions of 
the Al models 


Lack of quality data 


The team also created heatmaps for some of the questions to generate better insights from the results. 
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Barriers to Al adoption (In Section 3) 


Our business Customers do not 
processes require trust Al 
Customer face-to-face interaction 
related barriers with ae 










Costs in resolving Low government- 
f customer doubt in provided incentives 
Environment- Al solutions 
related barriers 
Making needed Changing corporate Likelihood of Insufficient 
N operational changes culture is difficult success is low support from top 
Organization- is difficult management 


related barriers 


Obsolete laws 
prevent financial 


Policy-related innovation based 








barriers on Al 
Have difficulties in There is no business The computing 
a analytical technigues context to apply Al ower is not 
Al readiness- vi : ree A MO 
related barriers 
Ar Results of Al Al cannot ensure Results of Al Directory 
Al application applications are not data security applications are Explainabil 
robust biased m Explainability 
result-related m Resources 
Our institution has Our institution has Our institution The CIO or a similar CIO does not have a 
not established a not established does not have a role cannot clearly direct reporting line 
formal prioritization formal processes to steering committee articulate a vision to the CEO 
k process for Al govern and manage at executive or for Al’s role in our 
Al govemance investments and Al projects senior management institution 
related barriers projects level responsible 
for determining Al 
development 








This heat map has been created based on banks’ responses to the extent of influence of 27 barriers to Al adoption 
identified in the survey. Five extent options were given to the respondents, namely, ‘very serious’, ‘serious’, ‘moderate’, 
‘minor’, and ‘ignorable’. 


The score of each barrier has been calculated by the weighted average of each option with their corresponding multipliers. 
The higher the score, the more the barrier is affecting Al adoption within the banking industry. The following is the formula 
used to calculate the score. 


[Murtipier — 
Ee ; 
C < 
C 2 
mo —— f 
C o 


(Count of ‘very serious’x4)+(Count of ‘serious’x3)+(Count of ‘moderate’ x2)+(Count of ‘minor’x1)+(Count of ‘ignorable’x0) 





Score= 


Total number of counts 
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Al use cases adoption (In section 4) 


Significant 
investment made 
and mature Al 
capability in 
place 


Core Banking 


Account 
Management 





67.00 


Product 
Innovation & 
Pricing 


23.00 


Collateral 
Management 


16.00 





No Al 
applications 
and no Al 
capacity 


Customer 


Customer Service 


48.00 


Customer 
Acquisition 


46.00 


Segmentation & 
Insights 


32.00 


Channel 


Mobile Internet 
56.00 56.00 


Call Centre 
29.00 
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Corporate 


Functions Propositions 


Fraud Detection 
and Prevention 


62.00 





Technology 
Management 


54.50 





Clearance & 
Settlement 


33.33 


Compliance 
Documentation & 
Reporting 


33.00 


Talent 
Management 


31.00 


Risk Assessment 
& Monitoring Corporate Consumer 
27.00 Lending Financing 


26.00 26.00 


Investment & 
Securities 


20.33 
Wealth 
Management 
19.50 


Finance & 
Reporting 


13.00 








This heat map has been created based on banks’ responses to the four stages of adoption, namely ‘launched’, ‘planned 
to be launched’, ‘being explored and studied’, and ‘no plan to explore or study’ of 27 use cases identified in the survey. 
Each use case is mapped back to the 25 banking functions categorised by the team as follows. 
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Banking function Al use cases 


Core Banking — Account Management 


Al on the maintenance of bank account or to provide value * Client-facing Chatbots 
added services (e.g. cash management) to customer 


Core Banking — Product Innovation & Pricing 


Al to identity new product opportunities; or to support in * Financial product pricing 
product pricing to maximize value 


Core Banking — Collateral Management 


Al to improve effectiveness and efficiency in managing ° Collection information management 
collection 


Customer — Customer Services e Client-facing Chatbots 


Al to provide direct or indirect (supporting Al) customer ; ; 
e Sentiment analysis 


services 
Customer — Customer Acquisition e Personalised advertisement and recommendations 
Al to acquire new customers e Remote onboarding 


Customer — Segmentation & Insights e Customer churn prediction 


Al to develop better insights on what customer needs and * Personalised advertisement and recommendations 


their segmentation e Transaction Data Enrichment (TDE) 


Channel — Mobile 
Al applications in mobile banking channel 


Channel — Internet 
Al applications in internet banking channel 


Channel — Call Center 


Al applications to support call center operation e.g., 
identification, routing and management of in/outbound call, 
customer services, customer sentiment analysis, etc. 


Channel — Branch 


Al applications to support branch, priority banking, teller * ATM cash management 
and ATM operations 


e Anti-Money Laundering (AML) 


Corporate Functions — Fraud Detection and Prevention e Fraud detection 
Al in fraud prevention, detection and management e Know-Your-Customer (KYC) 


e Transaction Data Enrichment (TDE) 


Corporate Functions — Technology Management e Cyber security applications 


Al in technology management e.g., automatic coding, 
DevOps, power consumption management, etc. 


e Contract analyser 


e Intelligent identification system 


Corporate Functions — Clearance and Settlement 


Al in clearance and settlement operations to mitigate * Operational automation 
operational risk and / or increase efficiency e Settlement 


e Contract analyser 
Corporate Functions — Compliance Documentation & 


Reporting 
Al in compliance and regulatory reporting processes * Report generation using Natural Language Generation 


e Operational automation 


Banking function AI use cases 


Corporate Functions — Talent Management 


Al in talent acquisition, identification, development and/or * Surveillance system for employee conduct 


retention processes 


Corporate Functions — Risk Assessment and Monitoring 


Al in risk appetite setting, policy setting, risk assessment, 
monitoring and / or mitigation 


e Economic forecasting / nowcasting 


e Risk management 


Corporate Functions — Finance & Reporting 
Al in finance and reporting processes in Fls 


Report generation using Natural Language Generation 
(NLG) 


Corporate Functions — Payment 


Al in both performing payment transaction and innovation 
and collaboration with non-Fls in new business model 


Propositions — Corporate Lending 


Al in lending assessment, monitoring, fault management, 
pricing, etc. 


Propositions - Consumer Financing 


Al in consuming financing products, operations and 
management 


Propositions — Investment & Securities 


Alin investment advisory, asset management and securities 
trading services 


Algorithmic trading 
Automatic trade execution 


Economic forecasting / nowcasting 


Propositions — Wealth Management 


Al to in wealth management products, operations and 
management 


Capital optimisation and portfolio management 


Economic forecasting / nowcasting 


e Financial advice 


Transaction Data Enrichment (TDE) 


Propositions - Cash Management 


Alin cash management services provided to customers 


Propositions — Cards 
Al in managing cards business 


Propositions — Investment Banking 

Al in investment banking e.g., opportunities identification, 
risk identification, and management, trading, talent 
management, internal operation, etc. 


Nil 


The AI adoption score for each banking function has been calculated using weights associated with each of these adoption 
stages. The higher the score, the more use cases have been identified for that function and the more mature the Al 
capability. The following is the formula used to calculate the score. 






Planned to be launched 


Being explored and studied 


No plan to explore or study 


3 
2 
1 
0 


(Count of ‘launched’x3)+(Count of ‘planned to be launched’x2)+(Count of ‘being explored and studied’x1)+(Count of ‘no plan to 


explore or study’x0) 
Score= 


Total number of counts 
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Appendix C: Summary of 
local FinTech-related 
university programmes 


The purpose of this study is to investigate the Al talent shortage in Hong Kong. There were 29 FinTech related programmes 
in Hong Kong in 2019 (ten Bachelor’s and nineteen Master’s degree programmes). These courses are classified as FinTech- 
related because the overall content is heavily technology and finance-focused. Graduates are expected to have extensive 
knowledge in areas such as data science, business analytics and information systems, and possess technical capabilities 
in various software technologies and programming languages. 


No. of 
intakes / 
graduates 


Undergraduate/ 


Al/Fintech Course University Reference Link 


Graduate 









https://www.jupas.edu.hk/en/ 
: programme/cityu/JS1000/ 


: tational : 
i lls : a .., : City University of 
Finance and Financial : 
: Hong Kong 








i Undergraduate 

































































; l i “https: j .edu.hk/en/ 
Undergraduate : Intelligence: Systems : : 30 ps://www.jupas.edu.hk/en 
i i i f i : programme/cuhk/JS4468/ 
AS i and Technologies _ The Chinese — 
: University of Hong ' https;//www.sta.cuhk. 
: Kong ' edu.hk/Programmes/ 
duat BSc in Statisti i N/A i 
PH aoe : UndergraduateStudies/ 
LL ss,  BSCinStatistics.aspx 
' Bachel f Art i i 
i me _ Š n i i : https://www.jupas.edu.hk/en/ 
Undergraduate : Sciences in Applied : | | :15 : 
: — ì : The University of i : programme/hku/JS6224/ 
Le. Attificial Intelligence ona Kon MAL, EE YY O 
; deredek ' Bachelor of Science : iii - _https://saasweb.hku.hk/ 
A ae. Majorin Statisties) = programme/statphp 
: BBA Information ; ; ' https://www.jupas.edu.hk/en/ ` 
: : 1 i 
Undergraduate : Systems : THE Heng kong i 8 : programme/hkust/JS5314/ 
FFY RH FEE ETT TITTET PP TATTI Bn Data Science | University of Science A a a aarti 
Undergraduate : c TA PRISNER : and Technology N/A https://dsct.ust.hk/ 
cman AUD Technology EE I A 
' Bachelor of Science 
: : : https: „j : .h 
Undergraduate : (Honours) in Data Lingnan University 25 : ps {denen |Lpas edu.hk/en/ 
Do, : programme/lingnanu/JS7225/ 
OGen H—— OY  —O RR“ —————— — 
“Information Systems ` : 
i | ; | : https;//bba.hkbu.edu.hk/eng/ 
: and e-Business : Hong Kong Baptist | : | 
Undergraduate i A i : N/A : programmes/curriculum/isem. 
: Management : University Lig 
i Concentration E 
: BSc (Hons) Scheme : The Hong Kong ' https://www.jupas.edu.hk/en/ 
t i T i 
eee Computing Polytechnic University “° programme/polyu/JS3868/ 
; ; ; ' https://gs.hkbu.edu.h 
i MASB OU ARTE Hong Kong Baptist | ee — A : 
Graduate ' Finance (FinTech and: 9 g Bap i 50 i prog 


Financial Analytics) 


: University 


' msc-in-finance-fintech-and- 
: financial-analytics 





Undergraduate/ 
Graduate 


Graduate 


Master of Science in 


Al/Fintech Course University 


; Master of Science In 


Financial Technology: 
' The University 


| No. of 
intakes / 
graduates 


Reference Link 








l http://www.mscfintech.ust.hk/ 


' http//www.msba.ust.hk/ 
: program-courses/program- 








' https://seng.ust.hk/academics/ 
: taught-postgraduate/msc-bdt 





' http://fintech.erg.cuhk.edu. 
: hk/programme-information/ 
: objectives 








' https://www.sta.cuhk.edu.hk/ 
' Programmes/PostgraduateStud- : 
: ies/ZMScinDataScienceandBusi- : 
: nessStatistics.aspx#overview : 

















' https://www.scifac.hku.hk/ : 
| prospective/tpg/MDASC#About- - 
: the-programme 





: https://www.scifac.hku.hk/ 
: prospective/tpg/MStat 








Graduate | : : ' of Science and 
: Business Analytics i 
— ...-okŴF!7Ï 4... Technology 
' MSc in Big Dat 
Graduate ; CIV g ata 
: Technology 
Master of Science in: 
Graduate Financial Technolo : 
: gy : The Chinese 
Si CN Laas Tee ee 
: MSc in Data Science i: y 9 
i f : Kong 
Graduate : and Business ; 
Statistics 
i Master of Data : 
t : 
aie : Science : The University of 
GT Pren nen] Hong Kong 
Graduate Master of Statistics: 
Graduate Doctor of FinTech 
NII HA DDA ME rae aa 
Gradients c In Information 
E ——— Technology 
Graduate i MSc In E-Commerce The Hong Kong 
RE LA A TASE TN UDA TY Polytechnic University 
: MSc In Software i 
Graduate : 
nen echnology 
| Master of Science in 
Graduate : , : 
: Business Analytics 
: MSc Business and 
ae ' Data Analytics 
' MSc Busi 
Graduate i c Fa 
: Information Systems 
NGR UG GN DN wene 
' Business and 
Graduate K led ; 
; o ene Ê : City University of 
au RE DYF GWM YD AN lU HY Hong Kong 
' MSc Electroni ; 
Graduate i a 
: Commerce 
MG EEEE 5 
Graduate ' Mathematics and 
Statistics 
: Master of Science in 
Graduate i 







' Data Science 





: https://fb.polyu.edu.hk/dfintech/ | 
: programme/overview/ 


' https://www.comp.polyu.edu.hk/ : 
: en-us/programmes/detail/7 








' https://www.comp.polyu.edu.hk/ : 
: en-us/programmes/detail/5 








' https://www.comp.polyu.edu.hk/ : 
: en-us/programmes/detail/8 








' http//www51.polyu.edu.hk/ 

: eprospectus/tpg/2020/23090- 
: f-map 
: http://www.cb.cityu.edu.hk/ 
: postgrad/?category=Program- 
' me&admitProgCode=P84 
























: http://www.cb.cityu.edu.hk/ 
: postgrad/?category=Program- 
: me&admitProgCode=P05B 








: http://www.cb.cityu.edu.hk/ 
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